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Abstract. We consider a specific class of tree structures that can represent basic struc- 
tures in linguistics and computer science such as XML documents, parse trees, and tree- 
banks, namely, finite node-labeled sibling-ordered trees. We present axiomatizations of 
the monadic second-order logic (MSO), monadic transitive closure logic (FO(TC^)) and 
monadic least fixed-point logic (FO(LFP^)) theories of this class of structures. These logics 
can express important properties such as reachability. Using model-theoretic techniques, 
we show by a uniform argument that these axiomatizations are complete, i.e., each formula 
that is valid on all finite trees is provable using our axioms. As a backdrop to our positive 
results, on arbitrary structures, the logics that we study are known to be non-recursively 
axiomatizable. 



In this paper, we develop a uniform method for obtaining complete axiomatizations of 
fragments of MSO on finite trees. In particular, we obtain a complete axiomatization 
for MSO, FO(TC^), and FO(LFP^) on finite node-labeled sibling-ordered trees. We take 
inspiration from Kees Doets, who proposed in [12] complete axiomatizations of FO-theories 
in particular on the class of node-labeled finite trees without sibling-order (see Section [U 
where we discuss his work in more details). A similar result for FO on node-labeled finite 
trees with sibling order was shown in [2j in the context of model-theoretic syntax and in [9] 
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in the context of XML query languages. We use the signature of ^ and extend the set of 
axioms proposed there to match the richer syntax of the logics we consider. 

Finite trees are basic and ubiquitous structures that are of interest at least to math- 
ematicians, computer scientists (e.g. tree-structured documents) and linguists (e.g. parse 
trees). The logics we study are known to be very well-behaved on this particular class of 
structures and to have an interestingly high expressive power. In particular, they all allow 
to express reachability, but at the same time, they have the advantage of being decidable 
on trees. 

As XML documents are tree-structured data, our results are relevant to XML query 
languages. Declarative query languages fro both relational and XML data are based on 
logical languages. In [11] and [16], MSO and FO(TC^) have been proposed as a yardstick 
of expressivity of navigational query languages for XML. It is known that FO(LFP^) has 
the same expressive power as MSO on trees, but the translations between the two are non- 
trivial, and hence it is not clear whether an axiomatization for one language can be obtained 
from an axiomatization for the other language in any straightforward way. One important 
and well-studied problem for XML query languages, as well as for database query languages 
in general, is query optimization. Typically, a query can be expressed in many equivalent 
ways, and the execution time of a query depends strongly on the way it is expressed. A 
common approach to database query optimization is by means of a set of rewrite rules, 
allowing one to transform a query expression into another equivalent one, together with a 
cost model that predicts the execution time of a query expression on a given database [T]. 
In , a sound and complete set of rewrite rules for the XML path language Core XPath 
2.0 was obtained from a complete axiomatization of the first-order theory of finite trees, 
exploiting the fact that Core XPath 2.0 is expressively complete for first-order logic. We 
expect that the results we present here can be used in order to obtain sound and complete 
sets of rewrite rules for dialects of Core XPath that are expressively complete for FO(TC^) 
and for MSO, such as the ones presented in [71 lllj . 

In applications to computational linguistics, finite trees are used to represent the gram- 
matical structure of natural language sentences. In the context of model theoretic syntax, 
Rogers advocates in [27] the use of MSO in order to characterize derivation trees of context 
free grammars. Kepser also argues in [21] that MSO should be used in order to query 
treebanks. A treebank is a text corpus in which each sentence has been annotated with 
its syntactic structure (represented as a tree structure). In ^22j and [30j Kepser and Tiede 
propose to consider various transitive closure logics, among which FO(TC^), arguing that 
they constitute very natural formalisms from the logical point of view, allowing concise and 
intuitive phrasing of parse tree properties. 

The remainder of the paper is organized as follows: in Section [2] we present the concept 
of finite tree and the logics we are interested in together with their standard interpreta- 
tion. Section [3] merely state our three axiomatizations. In Section HI we introduce non 
standard semantics called Henkin semantics for which our axiomatizations are easily seen 
to be complete. We prove in detail the FO(LFP^) Henkin completeness proof. Section [5] 
introduces operations on Henkin structures: substructure formation and a general operation 
of Henkin structures combination. We obtain Feferman-Vaught theorems for this operation 
by means of Ehrenfeucht-Fraisse games. This section contains in particular the definitions 
and adequacy proofs of the Ehrenfeucht-Fraisse games that we also use there to prove our 
Feferman-Vaught theorems. In Section [U we prove real completeness (that is, on the more 
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restricted class of finite trees). For that purpose, we consider substructures of trees that we 
call forests and use the general operation discussed in Section [5] to combine a set of forests 
into one new forest. Our Feferman-Vaught theorems apply to such constructions and we 
use them in our main proof of completeness, showing that no formula of our language can 
distinguish Henkin models of our axioms from real finite trees. We also point out that 
every standard model of our axioms actually is a finite tree. Finally, we notice in Section [7| 
that a simplified version of our method can be used to show similar results for the class of 
node-labeled finite linear orders. 

2. Preliminaries 

2.1. Finite Trees. A tree is a partially ordered set with a unique element called the root 
and such that apart from the root, each element (or node) has one unique immediate 
predecessor. We are interested in finite node-labeled sibling-ordered trees: finite trees in 
which the children of each node are linearly ordered. Also, the nodes can be labeled by 
unary predicates. We will call these structures finite trees for short. 

Definition 2.1 (Finite tree). Let be a fixed finite set of unary predicate 

symbols. By a finite tree, we mean a finite structure 97t = (M, <, ^, Pi, . . . , P„), where 
(M, <) is a tree (with < the descendant relation) and ~< linearly orders the children of each 
node. 

2.2. Three Extensions of First-Order Logic. In this section, we introduce three exten- 
sions of FO: MSO, FO(TC^) and FO(LFP^). In the remainder of the paper (unless explicitly 
stated otherwise), we will always be working with a fixed purely relational vocabulary a (i.e. 
with no individual constant or function symbols) and hence, with cr-structures. We assume 
as usual that we have a countably infinite set of first-order variables. In the case of MSO 
and FO(LFP^), we also assume that we have a countably infinite set of set variables. The 
semantics defined in this section we will refer to as standard semantics and the associated 
structures, as standard structures. 

We first introduce monadic second order logic, MSO, which is the extension of first-order 
logic in which we can quantify over the subsets of the domain. 

Definition 2.2 (Syntax and semantics of MSO). Let At stand for a first-order atomic 
formula (of the form R{x)), x = y, or T), x a first-order variable and X a set variable. The 
set of MSO formulas is given by the following recursive definition: 

(p := At I Xx |(/?A'i/'|'/'V'i/'|'/'^'0l I I 3X if 

We use VX(/7 (resp. Vx(/5) as shorthand for (resp. -iBx-k/j). We define the quan- 

tifier depth of a MSO formula as the maximal number of first-order and second-order nested 
quantifiers. We interpret MSO formulas in first-order structures. Like for FO formulas, the 
truth of MSO formulas in UJT is defined modulo a valuation g of variables as objects. But 
here, we also have set variables, to which g assigns subsets of the domain. We let g[a/x\ 
be the assignment which differs from g only in assigning a to x (similarly for g'[^/X]). The 
truth of atomic formulas is defined by the usual FO clauses plus the following: 

9K, g \= Xx iff g{x) € g{X) for X a set variable 
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The truth of compound formulas is defined by induction, with the same clauses as in 
FO and an additional one: 

Wt,g\= 3Xip iff there is ^ C M such that M, g[A/X] \= ip 

The second logic we are interested in is monadic transitive closure logic, FO(TC^), which 
extends FO by closing it under the transitive closure of binary definable relations. 

Definition 2.3 (Syntax and semantics of FO(TC^)). Let At stand for a first-order atomic 
formula (of the form R{x)), x = y, oi T), u,v,x,y first-order variables and ip{x,y) a 
FO(TC^) formula (which, besides x and y, possibly contains other free variables). The set 
of FO(TC^) formulas is given by the following recursive definition: 

:= At \ Xx |(^A'0|(^V'^|(^^'0| I 3x (/p I \TCxyV{x, y)] {u, v) 

We use \/x(p as shorthand for -'3x-'ip. We define the quantifier depth of a FO(TC^) formula 
as the maximal number of nested first-order quantifiers and TC operators. We interpret 
FO(TC^) formulas in first-order structures. The notion of assignment and the truth of atomic 
formulas is defined as in FO. The truth of compound formulas is defined by induction, with 
the same clauses as in FO and an additional one: 

Tl,g\= [TC^yip]{u,v) 
iff 

for all A CM, a g{u) € A 
and for all a, 6 G M, a E A and dJl, g[a/x, b/y] |= (p{x, y) implies 6 G ^, 

then g{v) € A. 

Proposition 2.4. On standard structures, the following semantical clause for the TC op- 
erator is equivalent to the one given above: 

1= [TC^yip{x,y)]{u,v) 

there exist oi . . . a„ € M with g{u) = ai and g{v) = an 
and Tl,g \= 93(04, a^+i) for all < i < n 

Proof. Indeed, suppose there is a finite sequence of points ai . . . a„ such that g{u) = oi, 
g{v) = On, and for each i < n, Tl, g[x / Oi] y / Oi+i] \= ip{x,y). Then for any subset A closed 
under cp and containing ai , we can show by induction on the length of the sequence oi . . . a„ 
that On belongs to A. Now, on the other hand, suppose that there is no finite sequence like 
described above. To show that there is a subset A of the required form, we simply take 
A to be the set of all points that can be reached from u via (/? by a finite sequence. By 
assumption, v does not belong to this set and the set is closed under tp. □ 

Intuitively this means that for a formula of the form \JCxy'p]{u, v) to hold on a standard 
structure, there must he a finite 'V path" between the points that are named by the variables 
u and V. 

Finally we will also be interested in monadic least fixed-point logic FO(LFP^), which 

extends FO with set variables and an explicit monadic least fixed point operator. Consider 
a FO(LFP^)-formula (p{X, x) and a structure fUt together with a valuation g. This formula 
induces an operator taking a set A C dom{dJl) to the set {a : Wl, g[a/x, A/X] |= (p}. 
FO(LFP^) is concerned with least fixed points of such operators. If (p is positive in X 
(a formula is positive in X whenever X only occurs in the scope of an even number of 
negations), the operator is monotone (i.e. X C.Y implies F^{X) C F^{Y)). Monotone 
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operators always have a least fixed point LFP{F) = {^{X\F{X) C X} (defined as the 
intersection of all their prefixed points). 

Definition 2.5 (Syntax and semantics of FO(LFP^)). Let At stand for a first-order atomic 
formula (of the form R{x)), x = y, oiT), X a set variable, x, y FO-variables, ip, ^ FO(LFP^)- 
formulas and ip{x,X) a FO(LFP^)-formula positive in X (besides x and X, ip{x,X) possibly 
contains other free variables). The set of FO(LFP^) formulas is given by the following 
recursive definition: 

il; := At \ Xy \ ijj A ^ \ ip V ^ \ il^ ^ ^ \ ^ip \ 3x il^ \ [LFPxx'fix, X)]y 

We use yxip as shorthand for -'Bx-'tp. We define the quantifier depth of a FO(LFP^)-formula 
as the maximal number of nested first-order quantifiers and LFP operators. Again, we can 
interpret FO(LFP^)-formulas in first-order structures. The notion of assignment and the 
truth of atomic formulas are defined similarly as in the MSO case. The truth of compound 
formulas is defined by induction, with the same clauses as in FO and an additional one: 

931,5 N [LFPxMy 
iff 

for ah A C dom{Tl), if for all a E dom{Tl), Tl,g[a/x,A/X] \= 'f{x,X) implies a e A, 

then g{y) G A. 

Remark 2.6. In practice we will use an equivalent (less intuitive but often more convenient) 
rephrasing: 

Tl,g^ [LFPxMy 
iff 

for all A C dom{m), if g{y) ^ A, 
then there exists a € dom{DJt) such that a ^ A and Tl, g[a/x, A/ X] \= ip{x,X). 

2.3. Expressive Power. There is a recursive procedure, transforming any FO(LFP^)- 
formula if into a MSO-formula (p' such that 931,5 N iff 2Jt, 5 \= (p'. The interesting 
clause is {[LFPxx^{x, X)]yy = \/X{\/x{'p{x,X)' — )■ Xx) -> Xy). (The other ones are ah of 
the same type, e.g. ((/? A ip)' = {ip' A V'')-) This procedure can easily be seen adequate by 
considering the semantical clause for the LFP operator. 

Now there is also a recursive procedure transforming any FO(TC^) formula ip into 
a FO(LFP^) formula ip" such that 931,5 \= 531,(7 N v"- The interesting clause is 

{[rCxy^\{u, v))" = [LFPxyy = n V 3x{{Xx A ip{x, y)"))]v. Let us give an argument for this 
claim. By Proposition l2.4l it is enough to show that [LFPxyy = u\/3x{XxAip{x, y)")]v holds 
if and only if there is a finite ip" path from utov. For the right to left direction, suppose there 
is such a path ai . . . a„ with g{u) = oi and g{v) = an- Then, for any subset A of the domain, 
we can show by induction on i that if for all (1 < i < n), ai = u\/ 3x{{Ax A ip{x,ai)") 
implies G A, then v G A, i.e., [LFPxyy = uV 3x{{Xx A (p{x,y)"))]v holds. Now for the 
left to right direction, suppose there is no such (p" path. Consider the set A of all points 
that can be reached from u by a finite cp" path. By assumption, ^Av and it holds that 
V?/((y = uV 3x{Ax A (p{x, y)")) Ay), i.e., ^[LFPxyy = n V 3x{Xx A ip{x, y)")]v. 

It is known that on arbitrary structures FO(TC^) < FO(LFPi) < MSO (see [l3] or [21]) 
and on trees FO(TC^) <trees FO(LFPi) =trees MSO (see |ilj and [28])- It is also known 
that the (not FO definable) class of finite trees is already definable in FO(TC^) (see for 
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instance [22j), which is the weakest of the logics studied here. We provide additional detail 
in Section 16. 3[ 

3. The Axiomatizations 



h (j), whenever (p is a propositional tautology 
h Vx(^ — >■ (ff, where t is substitutable for x in (p 
\- \/x{(p ^ iIj) (\IXip yxTp) 
h — 7> Vx(^, where x does not occur free in (p 
\- X = X 

\- X = y ^ {if ip) , where if is atomic and ip is obtained 

from ip by replacing x in zero or more (but not necessarily 

all) places by y. 

if \- if and \- (p ip, then h ip 

if \- p, then h Vxcp 

Figure 1: Axioms and rules of FO 



COMP. h 3X\/x{Xx -f-)- p), where X does not occur free in (p 

MSOl. h yX(p — >■ (p[X/T], where T (which is either a set variable 

or a set predicate) is substitutable in ip for X. 

MS02. h yx{ip ^ V) ^ (vxv? ^ yxij) 

MS03. h 99 — 7- yXip, where X does not occur free in (p 



MSO Generalization if h 99, then h 



Figure 2: Axioms and inference rule of MSO 



FOl. 
F02. 
F03. 
F04. 
F05. 
F06. 



Modus Ponens 
FO Generalization 



FO(TCi) 

axiom 

FO(TCi) 

Genera- 
lization 



h [TCxyP>]iu,v) {{ip{u) A VxVy(V'(x) A p{x,y) 
where ip is any FO(TC^) formula 



if h ^ ^ ((P(u) A \fxyy{P{x) A Lp{x, y) 
and P does not occur in 
then l- ^ — )• [rCxy^]{u^v) 



P{y)))^P{v)), 



Figure 3: Axiom and inference rule of FO(TC^) 



As many arguments in this paper equally hold for MSO, FO(TC^) and FO(LFP^), we 
let A € {MSO, FO(TC^), FO(LFP^)} and use A as a symbol for any one of them. The 
axiomatization of A on finite trees consists of three parts: the axioms of first-order logic, 
the specific axioms of A, and the specific axioms on finite trees. 

To axiomatize FO, we adopt the infinite set of logical axioms and the two rules of 
inference given in Figure [1] (like in [H] , except from the fact that we use a generalization 
rule). Here, as in [H], by a propositional tautology, we mean a formula can be obtained 
from a valid propositional formula (also known as the sentential calculus) by uniformly 
substituting formulas for the proposition letters). Alternatively, FOl may be replaced by 
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FO(LFPi) h [LFPxMy ^ (Vx((^(x, V) ^ ^ V(y)) 

axiom where is any FO(LFP^) formula and 93(2;, -;/;) is the result 

of the replacement in '^{x, X) of each occurrence of X by V' 

(renaming variables when needed) 

FO(LFPi) if h ^ ^ (Vx((^(a;, P) P{x)) P{y)), 

Generalization and P positive in ip does not occur in ^, 

thenhC^jLFPxMiy) 

Figure 4: Axiom and inference rule of FO(LFP^) 



Tl. 


\/xyy\/z{x <yf\y<z^x<z) 


< is transitive 


T2. 


-i3x(x < x) 


< is irreflexive 


T3. 


VxVy(a; < y — > 3z(x <ch z f\z < y)) 


immediate child 


T4. 


3xVy(x < y) 


there is a unique root 


T5. 


VxVyVz(x <zAy<z^x<y\/y<x) 


linearly ordered branches 


T6. 


\lx\ly\lz{x <y/\y<z^x~<z) 


-< is transitive 


T7. 


Sx{x -< x) 


-< is irreflexive 


T8. 


VxVy(x ^ y — ?• 3z{x -<ns z r\z < y)) 


immediate next sibling 


T9. 


\lx3y{y ■< X A -^3z{z -< y)) 


there is a least sibling 


TIO. 


\lx^y{{x <y\J y ^x) ^ {3z{z <ch x Az <ch y) A x ^ y)) 


linearly ordered siblings 


Ind. 


Vx(Vy((x <yy X <y) ^ fiy)) ^ ^i^)) ~^ yx(p{x) 


induction scheme 


where 








ip{x) ranges over A-formulas in one free variable x, 




and 








X <ch y is shorthand for x < y A -<3z{z < y A x < z), 






X -<ns y is shorthand for x ^ y A -i3z(x -< z A z ^ y) 





Figure 5: Specific axioms on finite trees 



a complete set of axioms for propositional logic. To axiomatize MSO, the axioms and 
rule of Figure [2] are added to the axiomatization of FO and we call the resulting system 
'"MSG- COMP stands for "comprehension" by analogy with the comprehension axiom of set 
theory. MSOl plays a similar role as F02, MS02 as F03 and MS03 as F04. To axiomatize 
FO(TC^), the axiom and rule of Figure [3] are added to the axiomatization of FO and we 
call the resulting system I~fO(TC1)- axiomatize FO(LFP^), the axiom and rule of Figure 
m are added to the axiomatization of FO and we call the resulting system l-pQ(|_ppi). We 
are interested in axiomatizing A on the class of finite trees. For that purpose, we restrict 
the class of considered structures by adding to \-\ the axioms given in Figure [5] and we call 
the resulting system h^*^*^. Note that the induction scheme in Figure [5] allows to reason by 
induction on properties definable in A only. 
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Proposition 3.1. A finite structure SUt = {M, <, ~<, Pi, . . . , Pn) satisfies the axioms Tl- 
TIO if and only if ^ is a finite tree. 

Proof. It follows from the truth of Tl, T2 and T5 that (M, <) is a tree. Note that T3 and 
T4 are valid consequences of Tl, T2, T5 on finite structures. Furthermore, T6, T7 and 
TIO imply that -< linearly orders the children of each node (and that -< only relates to each 
other nodes that are siblings). Note again that T8 and T9 follow from T6, T7 and TIO on 
finite structures. □ 

In fact, as we will see later, cf. Theorem 16.71 the axioms Tl-TlO, together with the 
induction scheme Ind for A-formulas (where A € {MSO, FO(TC^), FO(LFP^)}) define the 
class of finite trees. 

We refer for basic definitions (e.g., proof by which we mean formal deduction, or axiom- 
atization by which we mean deductive calculus) to [13] and sometimes only sketch or even 
omit classical arguments. E.g., we assume the notion of being substitutable in a formula to 
be clear for both objects and set variables. For details on such basic notions and technics, 
we refer to the material extensively developed in [14] and in particular, to the proof of the 
FO completeness theorem presented there. The Henkin completeness proofs provided in 
Section H] are built on this classical material. 

We end this Section by spelling out some definitions that are specific to our paper. 

Definition 3.2. We say that a A-formula (p is A-provable if \-\ occurs (as the last line) 
in some A-proof and we say that it is A-consistent if its negation is not A-provable. 

Let r be a set of A-formulas and a A-formula. By T \-\ ip we will always mean that 
there are Vi) ■ ■ ■ j V'n € T such that \-\ {ipi A ... A ipn) V- Whenever F Ha ^^p does not 
hold, we say that ip is F-consistent. We say that F is A-consistent if T is F-consistent. 
Finally, we say that F is a maximal consistent set of A-formulas if F is consistent, and for 
each formula £ A, either G F or ^(p G F. 

Now the main result of this paper is that on standard structures, the A theory of finite 
trees is completely axiomatized by h^^^. In the remaining sections we will progressively 
build a proof of it. 

4. Henkin Completeness 

As it is well known, MSO, FO(TC^) and FO(LFP^) are highly undecidable on arbitrary 
standard structures and hence not recursively enumerable (by arbitrary, we mean when 
there is no restriction on the interpretation of the relation symbols from the signature, 
unlike in the case of, e.g., trees). So in order to show that our axiomatizations h^'^'^ are 
complete on finite trees, we refine a trick used by Kees Doets in his PhD thesis |12j . We 
proceed in two steps (the second step being the one inspired by Kees Doets). First, we 
show completeness theorems, based on a non-standard (so called Henkin) semantics for 
MSO, FO(TC^) and FO(LFP^) (on the general topic of Henkin semantics, see [IS], the 
original paper by Henkin and also [26]). Each semantics respectively extends the class of 
standard structures with non standard (Henkin) MSO, FO(TC^) and FO(LFPi)-structures. 
By the Henkin completeness theorems, our axiomatic systems h^*^*^ naturally turn out to be 
complete on the wider class of their Henkin-models. But we will see that compactness also 
follows from these completeness results and some of these Henkin models are infinite. As a 
second step, we show in Section [6] that no A-sentence can distinguish between standard and 
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non-standard A-Henkin-models among models of our axioms. Every finite Henkin model 
being also a standard model, this entails that our axioms are complete on the class of 
(standard) finite trees, i.e., each A-sentence valid on this class is provable using the system 

l_tree 
'^A • 

Now let us point out that Kees Doets was interested in complete axiomatizations of 
monadic "Ilj^-theories" of various classes of linear orders and trees. Considering such theories 
in fact amounts to considering first- order theories of such structures extended with finitely 
many unary predicates. Thus, he was relying on the FO completeness theorem and if he was 
working with non-standard models of particular FO-theories, he was not concerned with 
non standard Henkin-structures in our sense. In particular, he used Ehrenfeucht-Fraisse 
games in order to show that "definably well-founded" node-labeled trees have well-founded 
n-equivalents for all n. In Section 16.21 Lemma 16.41 which is the key lemma to our main 
completeness result, establishes a similar result for definably well-founded Henkin-models 
of the A-theory of finite node-labeled sibling-ordered finite trees. Hence, what makes the 
originality of the method developed in this paper is its use of Henkin semantics: we first 
create a Henkin model and then "massage" it in order to obtain a model that is among our 
intended ones. Similar methods are commonly used to show completeness results in modal 
logic, where "canonical models" are often transformed in order to obtain intended models 
(see p]). Remarkably, the completeness proof for the ^u-calculus on finite trees given in 
[8], which is directly inspired by the methods used here, proceeds in that way. There are 
numerous examples of that sort in modal logic (and especially, in temporal logic), but there 
is also one notable example in classical model theory. In 1970, Keisler provided a complete 
axiomatization of FO extended with the quantifier "there exist uncountably many" (see 
|20j). His completeness proof, which is established for standard models, is surprisingly 
simple, it relies on the construction of an elementary chain of Henkin structures and then 
uses the omitting types theorem. Hence all in all, these structures seem to provide a 
particularly convenient tool, not only for simple Henkin completeness proofs, but also for 
more refined completeness proofs with respect to interesting subclasses of Henkin models 
like standard models. 

Let us now introduce Henkin structures formally. Such structures are particular cases 
among structures called frames (note that such frames are unrelated to "Kripke frames") 
and it is convenient to define frames before defining Henkin-structures. In our case, a frame 
is simply a relational structure together with some subset of the powerset of its domain 
called its set of admissible subsets. A Henkin structure is a frame whose set of admissible 
subsets satisfies some natural closure conditions. 

Definition 4.1 (Frames). Let cr be a purely relational vocabulary. A a- frame 9Jt consists 
of a non-empty domain dom(9Jt), an interpretation in dom(9JT) of the predicates in a and a 
set of admissible subsets Agjt Q p{dom{dJVj) . 

Whenever = p{dom{^)), can be identified to a standard structure. Assignments 
g into 9Jt are defined as in standard semantics, except that if X is a set variable, then we 
require that g{X) £ Asjjj. 

Definition 4.2 (Interpretation of A- formulas in frames). A- formulas are interpreted in 
frames as in standard structures, except for the three following clauses. The set quantifier 
clause of MSO becomes: 

Tl,g\= 3Xip iff there is ^ G A<sx such that Tl,g[A/X] \= ip 
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The TC clause of FO(TC^) becomes: 

iff 

for all A € Agjt, if g{u) G A 
and for all a, 6 G dom(9H), a E A and 9Jt, ^([x/a, |= imply h ^ A, 

then (7(1;) G A. 
And finally the LFP clause of FO(LFP^) becomes: 

ajt,5 N [LFPxMv 
iff 

for all ^ G Agjt, if for all a G (iom(9Jt), 97t, 5f[a/x, vl/X] j= Lp{x,X) implies a G A, 

then G A. 

Definition 4.3 (A-Henkin-Structures). A A-Henkin-structure is a frame 5[R that is closed 
under parametric A-definability, i.e., for each A- formula (p and assignment g into 5[R: 

{a G M I S[R,5-[a/x] ^ (^} G 
We call a A-Henkin-structure SOT standard whenever every subset in dom(9Jt) belongs to 
Aon- 

Remark 4.4. Note that any finite A-Henkin-structure is a standard structure, as every 
subset of the domain is parametrically definable in a finite structure. Hence, non standard 
Henkin structures are always infinite. H 

Theorem 4.5. A is completely axiomatized on K- Henkin- structures by Ka, in fact for every 
set of A-formulas T and A-formula ip, (p is true in all A-Henkin-models of T if and only if 

r i-A ^p. 

We do not detail here the MSO proof, as it is a special case of the proof of completeness 
for the theory of types given in [26]. We focus only on the FO(LFP^) the FO(TCi) 

case is very similar, except that there is no need to consider set variables. Up to now we 
have been working with purely relational vocabularies. Here we will be using individual 
constants in the standard way, but only for the sake of readability (we could dispense with 
them and use FO variables instead). Also, whenever this is clear from the context, we 
will use h as shorthand for l~FO(LFPi)- begin the Henkin completeness proof for 

FO(LFP^). This will achieve the proof of Theorem 14. 5i 

Lemma 4.6 (Generalization Lemma for FO Quantifiers). IfT \~fo(LFP^) ^ ^ '^^^^ ™^ 
occur free in T, then T ^fo(LFP^) '^xip. 

Proof. We refer the reader to the proof for FO given by Enderton in [l^ page 117]. The 
same proof applies for FO(LFP^) (as well as MSO and FO(TC^)). □ 

Definition 4.7. We say that a set of FO(LFP^) formulas A contains FO(LFP^) Henkin 
witnesses if and only if the two following conditions hold. First, for every formula ip, if 
—Nxp G A, then -^ip[x/t\ G A for some term t and if ^[LFPxxP\y £ ^! then -^Py A 
-i3x(-iP3; A ip{P,x)) G A for some monadic predicate P. Second, if 99 G A and x is a free 
variable of ip, then \/x(Px o ip{x)) G A for some monadic predicate P. 

The originality of the FO(LFP^) case essentially lies in the notion of FO(LFP^)-Henkin 
witness of Definition 14.71 In order to use this notion in the proof of Lemma 14. 9t we also 
need the following lemma: 
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Lemma 4.8. Let T be a consistent set of FO(LFP^)-formulas and a FO(LFP^)- formula 
of the form \/x{}p •<->■ Px) with P a fresh monadic predicate (i.e. not appearing in V). Then 
r U {^} is also consistent. 

Proof. Suppose T\j\yx{ip Px)} is inconsistent, so there is some proof of _L from formulas 
in T\j\\/x{ip o Pa;)}. We first rename all bound variables in the proof with variables which 
had no occurrence in the proof or in Vx((/? -H- Px) (this is possible since proofs are finite 
objects and wc have a countable stock of variables). Also, whenever in the proof the 
FO(LFP^) generalization rule is applied on some unary predicate P, we make sure that this 
P is different from the unary predicate that we want to substitute by ip and which does 
not appear in the proof; this is always possible because we have a countable set of unary 
predicates. Now, wc replace in the proof all occurrences of Px by 99 (as we renamed bound 
variables, there is no accidental binding of variables by wrong quantifiers). Then, every 
occurrence of Vx((^ o Px) in the proof becomes an occurrence of \/x{(p (^), i.e., we have 
obtained a proof of ± from F U {Vx((/9 -(-)■ (^)}, i.e., from T {\/x{ip ^ (p) is provable, as it 
can be obtained by FO generalization from a propositional tautology). It entails that T is 
already inconsistent, which contradicts the consistency of T. Now it remains to show that 
the replacement procedure of all occurrences of Px by ip is correct, so that we still have a 
proof of _L after it. Every time the replacement occurs in an axiom (or its generalization, 
which is still an axiom as we defined it), then the result is still an instance of the given axiom 
schema (even for FO(LFP^) generalizations, because we took care that P is never used in 
the proof for a FO(LFP^) generalization). Also, as replacement is applied uniformly in the 
proof, every application of modus ponens stays correct: consider ip ^ £, and ip. Obviously 
the result ip* of the substitution will allow to derive the result ^* of the substitution from 
^* ^* and Ip*. Also _L* is simply _L, so the procedure gives us a proof of _L. □ 

Lemma 4.9. (FO(LFP^) Lindenbaum Lemma) Let a be a countable vocabulary and let 
a* = aU{cn \ G N} U {Pn I n G N} with Ci,Pi ^ a. If a set T of FO(LFP^)- formulas 
in vocabulary a is consistent, then there exists a maximal consistent set T* of a* formulas 
such that r C r* and T* contains FO(LFP^)-Henkin witnesses. 

Proof. Let F be a consistent set of well formed FO(LFP^)-formulas in a countable vocabulary 
a. We expand a to a* by adding countably many new constants and countably many new 
monadic predicates. Then F remains consistent as a set of well formed formulas in the 
new language. We fix an enumeration of all tuples consisting of two FO variables, one set 
variable and one formula of a*: 

< ipi,Xi,x[,Xi >, < ip2,X2,x'2,X2 >, < (p3,X3,x'^,X3 >,... 

(this is possible since the language is countable), where the ^pi are formulas, the 
FO variables and the Xi are set variables. 

• Let 03n-2 be -^Xn^Pn '^fn[xn/ci]j where q is the first of the new constants neither 
occurring in (pn nor in 9/. with k < 3n — 2. 

• Let 93n-i be ^[LFP^/^x„<fn]xn {-^PiXn A -.3x(-.P;x A (pn{Pi,x))), where P; is the first 
of the new monadic predicates neither occurring in pn nor in 6^ with A; < 3n — L 

• Let be yxn{<Pn PiXn), whcrc Pi is the first of the new monadic predicates neither 
occurring in pn nor in O/. with k < 3n. 

Call e the set of ah the Oi. 

Claim 4.10. F U 6 is consistent 
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If not, then because deductions are finite, for some m > 0, F U {6*1, . . . ,6m,0m+i} is 
inconsistent. Take the least such m. Then, by the definition of consistency and the axioms 
of prepositional logic, F U {^i, . . . , 6m} I — ^^m+i- Now there are three cases: 

(1) Om+i is of the form -Nxip — >■ -i(/j[x/c], so both T U {^i,...,^^} I~ -Nx(p and T U 
{6*1, . . . ,6m} \~ ^p{x/c\. Since c does not appear in any formula on the left, by Lemma 
14.61 r U {^1, . . . , 6m} l~ Vxc^, which contradicts the minimality of m (or the consistency 
of r if m = 0). 

(2) 9m+i is of the form -^[LFPxx^]y {~^Pv A -'^x{-^Px A ip{P,x))). In such a case both 
Tu{6i...6m}\- ^[LFPxxf]y and r U {6*1 . . . 6*^} h ^{^Py A ^3x{^Px A (p{P, x))) hold. 
It follows that r U {6*1 . . . 6m} l~ \/x{(p{P,x) P{x)) Py- Since P does not appear 
in any formula on the left, by FO(LFP^) generalization, T U {^i . . . 6m} l~ [LFPxx^]y, 
which contradicts the minimality of m (or the consistency of T whenever m = 0). 

(3) 6m+i is of the form Vx((/3 o Px). By Lemma l48l this is not possible. 

We extend L U to a maximal consistent set F* in the standard way (see for instance |14t 
page 137]). □ 

We will now show that if F* is a maximal consistent set that contains FO(LFP^)-Henkin 
witnesses, then F* has a FO(LFP^)-Henkin model 9Jtr*- 

Definition 4.11. Let F* C FORM{a) be maximal consistent and contain FO(LFP^)- 
Henkin witnesses. We define an equivalence relation on the set of FO terms, by letting 
ti =r* t2 iff ti = t2 £ T* . We denote the equivalence class of a term t by 

Proposition 4.12. =r* is an equivalence relation. 

Proof. By F05 and F06. □ 

Definition 4.13. We define 9Jlr* (together with a valuation gr*) out of F*. 

• M = {\t\ : t is a FO term } 

• Agjt^, = {At : T is a set variable or a monadic predicate} where At = {\t\ : Tt G T*} 

. i\h\,...,\tn\)eP^}iSPh...tneT* 

• gr*{x) = \x\ 

• g^, {X) = Ax 

Proposition 4.14. 9Jtr* is a FO(LFP^ )-Henkin structure. 

Proof. By construction of F* which contains FO(LFP^)-Henkin witnesses, this is immediate 
(we introduced a monadic predicate for each parametrically definable subset). □ 

Lemma 4.15. (Truth lemma) For every FO(LFP^) formula (p, Wlr*,gr* |= iff ip £T*. 

Proof. By induction on ip. 

The base case follows from the definition of SUtr* together with the maximality of F*. 
Now consider the inductive step: 

• Boolean connectives and FO quantifier: exactly as in FO (see [14' page 138]), basically, 
for the FO quantifier step we rely on the fact that F* contains FO(LFP^) Henkin witnesses 
and we use the ^3n+2 formulas introduced in the proof of Lemma 14.91 

• LFP operator: we want to show that 

aHr.,5r* N [LFPxxV]y iff [LFPxx^]y G F* 

— We first show that 
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ajtr.,9r. N [LFPxxip]y implies [LFPxx^]y S T*. 
Assume Tlr* , gr* \= [LFPxx^]y, i-e., for all monadic predicate in a* or set variable T, if 
gr*iy) ^ then there exists \t\ € M, such that |t| ^ At and dytr*,gr*[x/\t\,X/AT] \= 
if. It follows by induction hypothesis that for every such T, if Ty ^ T*, then there 
exists a term t such that Tt F* and (p{t,T) E F*. By maximal consistency of F* and 
using the contraposition of the F02 axiom, it follows that for all monadic predicate in 
(T* or set variable T such that Ty ^ T* , it holds that 3x{-^Tx A (p{x,T)) G F*. Now 
suppose [LFPxxV]y ^ T*. By maximal consistency of F*, we get ^[LFPxx^]y G T*. 
Then as F* contains FO(LFP^) Henkin witnesses, there is a predicate T such that for 
some n, Osn-i G T* is of the form ^[LFPxx^]y i~^Fy A -i3x(-iTx A ip{T,x))). By 
maximal consistency of F*, it follows that ^Ty A -^3x{-^Tx A f{T,x)) E F*. Hence 
there is a predicate T such that Ty F* and -'3x{-^Tx A ip{x,T)) G F*. But that 
contradicts the consistency of F*, as we previously showed that whenever Ty F*, then 
also 3x{^Tx A (p{x, T)) G F*. Then ^[LFPxxf]y T* and by maximal consistency of 

F*, [LFPxxv]y G r*. 

— We now show that [LFPxxV]y £ T* implies Tlr*,gr* \= [LFPxx^]y- We consider the 
contraposition 

9JTr.,5r* ^ [LFPxx^]y implies [LFFxa^V^ly T*. 
Assume 9JTr*,(7r* [FFPxxH^]y- So SOTr*,^?* |= "'[-^^^-Pxi'V']?/ and there exists a 
monadic predicate in a* or a set variable T such that At G Agjtp, , ^(y) ^ and for 
all \t\ £ M, |t| G At or 2Jtr* , 5r* [aj/l^l, -'^/r] ^ ^v?. By induction hypothesis Ty F* 
and for all term t, either Tt G F*, or ^Lp{t,T) G F*. By maximal consistency of F*, 
for all term t, Tt V ^(p{t,T) G F*. Now assume ^Vx(T3; V -^ip{x,T)) G F*. As F* 
contains Henkin witnesses, there is some n and some term t such that ^3n-2 £ T* is of 
the form -'VxiTx V ^(^(x, T)) ^ (Tt V ^99(t, T)) and hence Tt V ^ip{t, T) G F*, which 
contradicts the maximal consistency of F* . Hence Vx(r3;V-i(^(3;, T)) G F*. By maximal 
consistency of F*, -^TyA\/x{Tx\/^ip{T, x)) G F* and so also ^TyA^3x{^TxA(p{T, x)) G 
F*. Now suppose [LFPxxf]y G F*. Then by the LFP axiom, for every monadic 
predicate in a* or set variable T, we get that -^Ty — > 3x{-^T{x) A ip{x,T)) G F* and 
so ^{-^Ty A -i3x(-iTx A ip(T,x))) G F*. But that contradicts the maximal consistency 
of F*. 

□ 

Theorem 4.16. Every consistent set F of FO(LFP^)-formulas is satisfiable in a FO(LFP^)- 
Henkin model. 

Proof. First turn F into a FO(LFP^) maximal consistent set F* with FO(LFP^)-Henkin 
witnesses in a possibly richer signature (with extra individual constants and monadic pred- 
icates) a*. Then build a structure Tlr* out of this F*. Then the structure Tlr* satisfies F* 
under the valuation g^* and hence it satisfies also F (F being a subset of F*). □ 

Compactness follows directly from Definition 13.21 and Theorem 14.51 i-^-j a possibly 
infinite set of A-sentences has a A-Henkin model if and only if every finite subset of it has 
a A-Henkin model. It also follows directly from Theorem 14.51 that h^*^*^ is complete on the 
class of its A-Henkin-models. Nevertheless, by compactness the axioms of h^*^^ also have 
infinite models. We overcome this problem by defining a slightly larger class of Henkin 
structures, which we will call definahly well-founded K-quasi-treesa 



For a nice picture of a quasi-tree that is not definably well-founded, see [2]. 
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Definition 4.17. A A-quasi-tree is any A-Henkin structure 

(T,<,^,Pi,...,P„,Ar) 

(where Aj- is the set of admissible subsets of T) satisfying the axioms Tl-TlO of Figure [5j 
A A-quasi-tree is definably well founded if, in addition, it satisfies ah A-instances of the 
induction scheme Ind of Figure [5j 

With this definition, we obtain from Theorem 14.51 the following: 

Corollary 4.18. A set of A-formulas is consistent if and only if it is satisfiable in a 
definably well-founded A-quasi-tree. 

5. Operations on Henkin-Structures 

Let A G {MSO, FO(TC^), FO(LFP^)}. As noted in Remark|131 every finite A-Henkin struc- 
ture is also a standard structure. Hence, when working in finite model theory, it is enough 
to rely on the usual FO constructions to define operations on structures. On the other 
hand, even though our main completeness result concerns finite trees, inside the proof we 
need to consider infinite (A-Henkin) structures and operations on them. In this context, 
methods for forming new structures out of existing ones have to be redefined carefully. 
We first propose a notion of substructure of a A-Henkin-structure generated by one of its 
parametrically definable admissible subsets: 

Definition 5.1 (A-substructure). Let SOT = {dom{dyV),Pred,A<jji) be a A-Henkin-structure 
(where Pred is the interpretation of the predicates). We call Tlfo = {dom{Tl), Pred) the 
relational structure underlying SOT. Given a parametrically definable set A G Agjt, the A- 
substructure of VXt generated by A is the structure Tl \ A = {{A)^^^, Atj)x\A), where (A)^^^ 
is the relational substructure of SOZfo generated by A (note that A forms the domain of 
{A)<)}ifQ, as the vocabulary is purely relational) and Aruttyi = {X n A\X G Agji}. 

Note that in the case of MSO and FO(LFP^), we could also have defined A^fA in an 
alternative way: 

Proposition 5.2. Take and A as previously and consider the structure (971 \ A)' = 
((^)otpq, Aj-gjj^^)/), where A(OTfA)' = {X ^ A\X G A^}. Whenever Tl is a MSO-Henkin 
structure or a FO(LFP^)-Henkin structure, Tl \ A and {Tl \ A)' are one and the same 
structure. 

Proof. Indeed, take B G Aggj^^. So there exists B' G A^ such that B = B' n A. We want to 
show that also B' CiA £ A(OT|^^y i.e. B' n A C A (which obviously holds) and B' HA £ A^. 
The second condition holds because both B' and A are parametrically definable in 9Jt, so 
their intersection also is (B' CiA = {x \Tl\= AxAB'x}). Conversely, consider B G A^^rjji^j^y. 
As B <^ A and B G Agjt it follows that B G A<xhia (we can take B = BnA). □ 
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Now, in order to show that A-substructures are A-Henkin-structures, we introduce a 
notion of relativization and a corresponding relativization lemma. This lemma estabhshes 
that for every A-Henkin-structure 9Jt and A-substructure 9Jt \ A oi dJt (with A a set para- 
metricahy definable in 9Jl), if a set is parametrically definable in 9Jt f j4 then it is also 
parametrically definable in 9JT. This result will be useful again in Section 16.21 

Definition 5.3 (Relativization mapping). Given two A- formulas (/?, ip having no variables in 
common and given a FO variable x occurring free in tp, we define REL{(p, ip, x) by induction 
on the complexity of (p and call it the relativization of if to tp: 

• If 99 is an atom, REL{ip,'ip,x) = 99, 

• If 99 991 A ip2, REL{ip, Tp, x) = REL{(pi,ip, x) A REL{ip2,Tp, x) (similar for V, -■), 

• liip:^ 3yX, REL{ip, ip, x) = 3y{ip[y/x] A REL{x, V', x)), 

• If V9 :« 3Yx, REL{^, tP, x) = 3Y{yx(Yx ^ ^) A REL{x, V', x)), 

• If V9 :Ri [TCyzX]{u,v), 

REL{ip,iP,x) = [TCy,REL{x,ip,x) AiPiy/x] A7p[z/x]]{u,v), 

• If^:^ [LFPxyX]z, REL{^,^,x) = [LFPxyX ^^[ylx]\z. 

where ip[y/x\ is the formula obtained by replacing in ^/^ every occurrence of x by y and 
similarly for ip[z/x\. 

Hence for instance, REL{3yP{y),Q{x),x) = 3y{P{y) /\Q{y)), which is satisfied in any 
model of which the submodel induced by Q contains an element satisfying P. 

Lemma 5.4 (Relativization lemma). Let ^ be a A-Henkin-structure, g a valuation on dJt, 
ip, ip K- formulas having no variable in common and A = {x \ 9Jt, (7 \= %p}. If g{y) € A for 
every variable y occurring free in 99 and g{Y) G ^m\A for every set variable Y occurring 
free in ip, then 9Jl, g \= REL{ip, ip, x) <^ Tl \ A, g \= ip. 

Proof. By induction on the complexity of ip. Let g be an assignment satisfying the required 
conditions. Base case: p is an atom and REL{p,ip,x) = p. So Tl,g \=p<^Tl\A, g\=p> 
(by hypothesis, g is a suitable assignment for both models). Inductive hypothesis: the 
property holds for every p of complexity at most n. Now consider p of complexity n + 1. 

• (/9 :« (/9i A (/92 and REL{pi Ap2,'4'ix) :~ REL{pi,ip, x) A REL{p2,^p, x). By induction 
hypothesis, the property holds for pi and for p2- By the semantics of A, it also holds for 
Pi A p2- (Similar for V, — -■.) 

• p -.^ 3yx and REL{3yx) :~ 3y{ip[y/x\ A REL(xt'4'ix)). By inductive hypothesis, for 
every node a G A, Tl,g[a/y] \= REL{x,ip,x) ^ \ A,g[a/y] \= x- Hence, by the 
semantics of 3 and by definition of A, Tl,g \= 3y{jp[y/x] A REL{x,tp, x)) 4^ Tl \ A, g \= 

• (/9 :~ 3Yx and REL(3Yx, tp, x) = 3Y{yx(Yx ip) A REL(xi i^i x)). As every admissible 
subset of 9Jl f A is also admissible in 9K (by Proposition 15. 2p it follows by inductive 
hypothesis that for every B G A^^a with B C A, d)l,g[B/Y] \= REL{x,'ip,x) ^ Tl \ 
A, g[B/Y] \= X- Hence, by the semantics of 3 and by definition of A, g \= 3Y(yx(Yx — > 
V^) A REL{x, i^,x))^Tl\ A,g^ 3Yx. 

• p :« [TCy,x]{u,v) and REL{[TCy,x]{u,v),i^,x) = [TCy,REL{x,i^,x) A tP[y/x] A 
ip[z/x]]{u,v). By definition of TC, the following are equivalent: 

L m\ A,g^[TCy,x]iu,v), 

2. for all B G Asjjjf^, if g{u) G B and for all a,b e A, a ^ B and Tl \ A,g[a/y,b/z] \= x 
implies b € B, then g{v) G B. 
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By inductive hypothesis, for all a,b G A, 

m, g[a/y, b/z] ^ REL{x, ^,x)<^m\A, g[a/y, b/z] ^ x- Hence 2. 4^ 3.: 

3. for all B S ^m\A-, if g{u) G B and for all a, 6 € j4, a € i? and 9Jt, (^[a/y, ft/z] |= 
REL{xjipi x) implies b G B, then g{v) € B, 

By definition of A, 3. -<4> 4.: 

4. for all € A^i^^, if g((n) € B and for all a,b (iom(9Jt), a & B and 9Jt, g[a/y, b/z] \= 
REL{xj V') 2;) A V'ly/^;] implies b G B, then ) E i?, 

We claim that 4. 5.: 

5. for all C E Agjj, if (^(ti) G C and for all a,b £ dom{Tt), a G C and Tl, g[a/y,b/z] \= 
REL{xj ipi x) A iply/x] A tp[z/x\ implies b G C, then g{v) G C, 

which, by the semantics of TC, is equivalent to: 

6. h [TCy,REL{x,^,x) Aij[y/x] AiP[z/x]]{u,v). 

It is clear that 5. ^4.. For the 4. =^ 5. direction, assume 4.. Take any set C G Aqj; such 
that g{u) G C and for all a,b G dom(Tl), a G C and DJl, g[a/y,b/z] \= REL{x,ip, x) A 
iply/x] A tp[z/x\ implies b e C. Let 5 = ^ n C. By Definition \5l\ B G Agjt^^. Now 
by our assumptions on g and by definition of A, g[a/y,b/z] only assigns points in A. 
So as B = AnC, g{u) G B and for all a,6 G dom{Tl), a e B and Tl,g[a/y,b/z] \= 
REL{x, ip, x) A ipiy/x] A iplz/x] implies b e B. So by 4., g{v) e B. As 5 C C, it follows 
that g{v) G C. 

• 99 and REL{[LFPxyX\z-,'^,x) -.^ [LFPxyX ^ ''Piu By definition of 

LFP, the following are equivalent: 

1. m\A,g^ [LFPxyX]z, 

2. for all B G AgjtfA, if for all a G ^, f A,g[a/y,B/X] \= x implies a e B, then 
giz) G B. 

By inductive hypothesis, for all a £ A, B £ Tt \ A, M,g[a/y,B/X] \= REL{x,ip,x) 4^ 
Tl \ A,g[a/y,B/X] \= x- Hence 2. is equivalent to 3.: 

3. for all B G Agji^yi, if for all a G A, yR,g[a/y,B / X] \= REL^x,"^, x) implies a £ B, 
then g{z) G B, 

By definition of A, 3. -<4> 4.: 

4. for all S G Aot^A; if for all a G (iom(5!Jt), 
9Jt,5[a/y,S/X] ^ REL{x,il',x) Ai{;[y/x] implies a £ B, 
then g{z) G 5, 

We claim that 4. <^4> 5.: 

5. for ah C G A^, if for ah a G (iom(SUt), 

971, g[a/y, C/X] \= REL{x, V', x) A ^[y/x] implies o G C, 

then g{z) G C, 
which, by the semantics of LFP, is equivalent to: 

Q.Tl,g^ [LFPxyREL{x,^,x) A^[y/x]]z. 
It is clear that 5. =^ 4.. For the 4. =^ 5. direction, assume 4.. Take any set C G Agjj 
such that for all a G (iom(9Jt), 9Jt, ^[0/2/, C/X] ^ REL{x-,'^-,x) A iply/x] implies a G C. 
Let B = A n C. By Definition 15.11 B G Agjj^^. Consider a G dom(9JT) such that 
9Jt, ^[a/y, S/X] \= REL{xtiPtx) Aip[y/x]. As REL{x-,'4'-,x) is positive in X and X does 
not occur in ^, 5Ut, 5'[a/y, C/X] |= i?£'L(x, V'l 2;) A V'ly/a;]- Also by hypothesis a G C. Now 
as SOT, ^[a/y] |= ip[y/x], by definition of ^, a G A. So a G A n C, i.e, a € B and since we 
proved it for arbitrary a G dom{W), by 4., 5(2) G i?. As i? C C, it follows that g{z) G C. 

□ 
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Theorem 5.5. Let 9Jt and A be as in Definition \5.1\ Then \ A is a A-Henkin-siractitre. 

Proof. Take B parametrically definable in 9Jt f A, i.e., there is a A- formula ip{y) and an 
assignment g such that B = {a ^ dom{Tl \ A) \ \ A, g[a/y] \= f{y)}- Now we know that 
A is also parametrically definable in 9K, i.e., there is a A-formula il^^x) and an assignment g' 
such that A = {a € dom{Tl) \ d}l,g'[a/x] \= ^l'{x)}. Assume without loss of generality that ip 
and have no variables in common. We define an assignment g* by letting g*{z) = g'{z) for 
every variable z occurring in ip and g*{z) = g{z) otherwise. The situation with set variables 
is symmetric. Now by Lemma |5.4| i? = {a € dom(9Jl) | g*[a/x] \= REL{ip,'ip,x)} and 
hence B G Agjt- By definition 15.11 it follows that B € ^<xiliA (because B = B f] A). □ 

There is, in model theory, a whole range of methods to form new structures out of 
existing ones. Standard references on the matter are |15l I25| . written in a very general 
algebraic setting. Familiar constructions like disjoint unions of relational structures are 
redefined as particular cases of a new notion of generalized product of FO-structures and 
abstract properties of such products are studied. In particular, an important theorem now 
called the Feferman-Vaught theorem for FO is proven in [15] . We are particularly interested 
in one of its corollaries, which establishes that generalized products of relational structures 
preserve elementary equivalence. We show an analogue of this result for a particular case 
of generalized product of A-Henkin-structures that we call fusion, this notion being itself a 
generalization of a notion of disjoint union of A-Henkin-structures defined below. 

Definition 5.6 (Disjoint union of A-Henkin-structures). Let cr be a purely relational vo- 
cabulary and a* = a U {Qi, . . . , Qk}, with {Qi, . . . , Qk} a set of new monadic predicates. 
For any A-Henkin-structures 9Jti , . . . , in vocabulary a with disjoint domains, define 
their disjoint union 1+Ji<j<fc (or, direct sum) to be the a*-hame that has as its domain 
the union of the domains of the structures and likewise for the relations, except for the 
predicates Qi, whose interpretations are respectively defined as the domain of the structures 
Tli (we will use Qi to label the elements of Mj). The set of admissible subsets -'^y^^.^j. saii 
is the closure under finite union of the union of the sets of admissible subsets of the 9Jtj. 
That is: 

• dom(yi<i<fc5Jti) = Ui<»<fc^^om(93Ti) 

. pWi<,<fc9Jt, ^ y^^^^^,p2n, (^ith Pea) and = dom{mi) 

• A£ Ay^^^^^g^t, iff ^ = Ui<i<fc for some Ai £ A^^ 

Definition 5.7 (/-fusion of A-Henkin-structures). Let a be a purely relational vocabulary 
and a* = a U {Qi, ■ ■ ■ , Qk}, with {Qi, . . . , Qk} a set of new monadic predicates. Let / be 
a function mapping each n-ary predicate P G a to a quantifier-free first-order formula over 
a* in variables xi, . . . , x„. For any A-Henkin-structures 9Jti, . . . , 9Jtfc in vocabulary a with 
disjoint domains, define their f -fusion to be the cr-frame ©f<j<fcS[)Tj that has the same 
domain and set of admissible subsets as l±Ji<j<fc2^i- For every P £ a, the interpretation of 
P in ©f<j<,fc is the set of n-tuples satisfying f{P) in \Si<i<k'^i- 

An easy example of /-fusion on standard structures (it is simpler to give an example on 
standard structures, as we do not have to say anything about admissible sets) is the ordered 
sum of two linear orders (Mi, <i), (M2, <2), where all the elements of Mi are before the 
elements of M2. In this case, a consists of a single binary relation <, the elements of Mi are 
indexed with Qi, those of M2 with Q2 and / maps < to xi < X2 V (QiXi A Q2X2)- Another 



18 



A. GHEERBRANT AND B. TEN GATE 



notable example of /-fusion is the o U {Qi; • • • ; Qfc}-structure l+li<j<fc = ©i<j<fc ^ 
where / is the identity function and for each 1 < i < k, dJlf is the expansion of the a- 

structure 9Kj in which ' = dom{Wli) and ' =0 for every i ^ j. In this sense, disjoint 
union as we defined it above can be seen as a special case of fusion. 

We show preservation results involving /-fusions of A-Henkin-structures. Hence we deal 
with analogues of elementary equivalence for these logics and we refer to K- equivalence. Let 
us recall that by quantifier depth of a A-formula, we mean the maximal number of nested 
quantifiers in the formula (by "quantifier", we mean FO and MSO-quantifiers, as well as 
TC or LFP-operators). 

Definition 5.8. Given two A-Henkin-structures 9Jf and 91, we write 971 =a 91 and say 
that 9Jt and 91 are A-equivalent if they satisfy the same A-sentences. Also, for any natural 
number n, we write SOT =^ 91 and say that 9Jt and 91 are n- A- equivalent if SOT and 91 satisfy 
the same A-sentences of quantifier depth at most n. In particular, 9Jt =a 91 holds iff, for all 
n,m=im holds. 

Now we are ready to introduce the "Feferman-Vaught theorems" that we will show in 
Section [52] and which establish that /-fusions of A-Henkin-structures preserve A-equivalence, 
that is: 

Theorem 5.9. Let Tli, . . . , Tlk, 9Ii, . . . , 9lfc be A-Henkin structures. Whenever Tli =^ 9ti 
for alll<i<k, then also ©(<,<fc 9)t, =1 ©(<i<fc9Jti. 

We will also show in this section that every /-fusion of A-Henkin-structures is a A- 
Henkin-structure. Comparable work had already been done by Makowski in [25j for exten- 
sions of FO, but an important difference is that he only considered standard structures, 
whereas we need to deal with A-Henkin-structures. Our proofs make use of Ehrenfeucht- 
Prai'sse games for each of the logics A. 

5.1. Ehrenfeucht-Praisse Games on Henkin-Structures. 

Let A G {MSO, FO(TC^), FO(LFPi)}. We survey Ehrenfeucht-Fraisse games for FO, MSO, 
FO(TC^), and FO(LFP^) which are suitable to use on Henkin structures. We also provide 
an adequacy proof for the FO(TC^) game. The MSO game is a rather straightforward 
extension of the FO case and has already been used by other authors (see for instance 
|23j). The FO(LFP^) game is borrowed from Uwe Bosse [4]. It also applies to Henkin 
structures, as careful inspection shows. The FO(TC^) game has already been mentioned in 
passing by Erich Gradel in |17j as an alternative to the game he used and we show that it 
is adequate for Henkin semantics. It looks also similar to a system of partial isomorphisms 
given in [6j. However it is important to note that this game is very different from the 
FO(TC^) game which is actually used in [17]. The two games are equivalent when played 
on standard structures, but not when played on FO(TC^)-Henkin structures. This is so 
because the game used in [6] relies on the alternative semantics for the TC operator given 
in Proposition 12.41 so that only finite sets of points can be chosen by players ; whereas 
the game we use involves choices of not necessarily finite admissible subsets. These are 
not equivalent approaches. Indeed, on FO(TC^)-Henkin structures a simple compactness 
argument shows that the semantical clause of Proposition 12.41 (defined in terms of existence 
of a finite path) is not adequate. 
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Let us first introduce basic notions connected to these games. One rather trivial suf- 
ficient condition for A-equivalence is the existence of an isomorphism. Clearly isomor- 
phic structures satisfy the same A- formulas. A more interesting sufficient condition for 
A-equivalence is that of Duplicator having a winning strategy in all A Ehrenfeucht-Frai'sse 
games of finite length. To define this, we first need this notion: 

Definition 5.10 (Finite Partial Isomorphism). A finite partial isomorphism between struc- 
tures dJl and ^ is a finite relation {(oi, 61), . . . , (a„, 6„)} between the domains of Tl and ^ 
such that for all atomic formulas ip{xi, . . . , Tl \= ip [ai, . . . , an] iff *Jl |= [61 6„] . 
Since equality statements are atomic formulas, every finite partial isomorphism is (the graph 
of) a injective partial function. 

We will also need the following lemma: 

Lemma 5.11 (Finiteness Lemma). Fix any set xi, . . . X^+i, . . . ,Xm- In a finite rela- 
tional vocabulary, up to logical equivalence, with these free variables, there are only finitely 
many A-formulas of quantifier depth < n. 

Proof. This can be shown by induction on k. In a finite relational vocabulary, with finitely 
many free variables, there are only finitely many atomic formulas. Now, any A-formula of 
quantifier depth A: + 1 is equivalent to a Boolean combination of atoms and formulas of 
quantifier depth k prefixed by a quantifier. Applying a quantifier to equivalent formulas 
preserves equivalence and the Boolean closure of a finite set of formulas remains finite, up 
to logical equivalence. □ 

Now, as we are concerned with extensions of FO, every A-game will be defined as an 
extension of the classical FO game, that we recall here: 

Definition 5.12 (FO Ehrenfeucht-Frai'sse Game). The FO Ehrenfeucht-Frai'sse game of 
length n on standard structures 9JT and 91 (notation: EFpq{TI,^)) is as follows. There 
are two players, Spoiler and Duplicator. The game has n rounds, each of which consists 
of a move of Spoiler followed by a move of Duplicator. Spoiler's moves consist of picking 
an element from one of the two structures, and Duplicator's responses consist of picking an 
element in the other structure. In this way, Spoiler and Duplicator build up a finite binary 
relation between the domains of the two structures: initially, the relation is empty; each 
round, it is extended with another pair. The winning conditions are as follows: if at some 
point of the game the constructed binary relation is not a finite partial isomorphism, then 
Spoiler wins immediately. If after each round the relation is a finite partial isomorphism, 
then the game is won by Duplicator. 

Theorem 5.13 (FO Adequacy). Assume a finite relational first- order language. Duplicator 
has a winning strategy in the game EFpQ{dyt,^) iff^ =fo particular. Duplicator 

has a winning strategy in all EF-games of finite length between 5Jt and 91 if and only if 
Tl =FO 

The proof for the first order case is classic. We refer the reader to the proof given in 
[I3] or to the one in [2l] . 

For technical convenience in the course of inductive proofs, we extend the notion of 
FO parameter by considering set parameters, i.e., instead of interpreting a set variable as a 
name of the admissible set A, we can add a new monadic predicate A to the signature. The 
new predicates and the sets they name are called set parameters. (This is similar to the FO 
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notion that can be found in [H].) We will work with parametrized (or expanded) Henkin- 
structures, that is, structures considered together with partial valuations. This means that 
the assignment is possibly non empty at the beginning of the game, which can start with 
some "handicap" for Duplicator, i.e., some preliminary set of already "distinguished objects 
and sets". 

We first define a necessary and sufficient condition for MSO equivalence by extending 
Ehrenfeucht-Fraisse games from FO to MSO. This game has already been defined in the 
literature, see for instance [23J. 

Definition 5.14 (MSO Ehrenfeucht-Fraisse Game). Consider two MSO-Henkin structures 
Tl together with A G A^j^, a G dom{TtY and together with B G A^^, b G dom{^Y and 
r > 0, s > 0, n > 0. The MSO Ehrenfeucht-Fraisse game EF^QQ({m,A,a),{m,B,b)) of 
length n on expanded structures (9Jt, A, a) and (9T, B, b) is defined as for the first-order 
case, except that each time she chooses a structure. Spoiler can choose either an element 
or an admissible subset of its domain. For a given Ar+i G A<x)i chosen by Spoiler, {Tl, A, a) 
is expanded to (9Jt, ^, A^+i, a). Duplicator then responds by choosing -B^+i S A<n and 
(9T, B, b) is expanded to (9T, B, Br+i,b). The game goes on with the so expanded structures. 
The winning conditions are as follows: if at some point of the game a i— )• 6 is not a finite 
partial isomorphism from (9Jt, A, A^+i) to {'Ol,B,Br-\-i), then Spoiler wins immediately. If 
after each round the relation is a finite partial isomorphism, then the game is won by 
Duplicator. 

Theorem 5.15 (MSO Adequacy). Assume a finite relational MSO language. Given 9Jl and 
A G Ajjjj, B G A^, a G dom{Tiy , b G (iom(9T)* and r > 0, s > 0, n > 0, Duplicator has a 
winning strategy in the game i?F^gQ((9JT, A, a), (9T, B, b)) iff (SPt, A, a) and (OT, B, b) satisfy 
the same MSO formulas of quantifier depth n. In particular, Duplicator has a winning 
strategy in all EFMso-gaTn^s of finite length between (9H, A, a) and (OT, B, b) if and only if 
(9Jt, A, a) and (9T, B, b) satisfy the same MSO formulas. 

We omit the proof, because it parallels the FO case. The proof works regardless whether 
MSO is interpreted in the standard or in the Henkin way. What matters here is that the 
game-theoretic meaning of a "quantification" over a given "domain" , lies in the choice of an 
element from that domain (including one consisting of "higher-order elements", e.g., sets). 

Corollary 5.16. For MSO-Henkin-structures dJt, 91 and n > 0, Duplicator has a winning 
strategy in EFf^gQ{d}l, 91) if and only ifTl =^sq 91. In particular, Duplicator has a winning 
strategy in all EFMSQ-games of finite length between dJt and 91 if and only if^ =MSO 

The FO(TC^) game that we will be introducing now had been already mentioned in 
passing by Erich Gradel in [17] as an alternative to the game he used. We will show that it 
is adequate on Henkin-structures. 

Definition 5.17 (FO(TCi) Ehrenfeucht-Fr aisse Game). Consider two FO(TC^)-Henkin 

structures and 91 together with a G dom{^Y , b G dom(9t)* and s > 0, n > 0. The 
FO(TC^)-game EF^Q^jQi^{{Tl,a), {^,b)) of length n on expanded structures (9H, a) and 

(91, b) is defined as for the first-order case, except that each time she chooses a structure, 
Spoiler can either choose only one element or an admissible subset together with two ele- 
ments of its domain. In the first case we say that she plays an 3 (or point) move and in 
the second case, a TC-move (which we will define more precisely below). Each point move 
results in an extension of the assignment {a i-^ 5} with elements a^+i G (iom(9Jt), G 
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(iom(OT). Each TC-move results in an extension of the assignment {a b} with elements 
as+i,as+2 G dom{Tl),bs^i,bs+2 G dom{yt). At each round, Spoiler chooses the kind of 
move to be played. 

The 3 move is defined as in the FO case. The TC-move is as follows: 
Spoiler considers two pebbles {ai,bi) and {aj,bj) on the board (i.e., corresponding cou- 
ples of parameters taken in each structure) and depending on the structure that he chooses 
to consider, he plays: 

• either a set A € Agjj with Ui & A and Uj ^ A. Duplicator then answers with a set B € Af^ 
such that bi G B and bj ^ B. Spoiler now picks 5^+1 £ B,bs+2 ^ B and Duplicator 
answers with a^+i € A, as+2 ^ A. 

• or a set B € Ar^ with bi (z B and bj ^ B. Duplicator then answers with a set A E Agji 
such that tti € A and aj ^ A. Spoiler now picks a^+i G A, 0^+2 ^ A and Duplicator 
answers with bs-^-i G B, 65+2 ^ B. 

In each TC-move, the assignment is extended with a^+i 1— > 6^+1,0^5+2 ^ 6^+2- After n 
moves. Duplicator has won if the constructed assignment a 1— >• 6 is a partial isomorphism 
(i.e. the game continues with the two new pebbles in each structure, but the sets A and B 
are forgotten). 

Theorem 5.18 (FO(TCi) Adequacy). A ssume a finite relational FO(TC^) language. Given 
two FO(TC^ )-Henkin structures 9Jl and a € dom{WY , b € dom(9T)* and r > 0, s > 0, 
n > 0, Spoiler has a winning strategy in the game £'TPq^.^^i^((9JI, a), (OT, 6)) iff there is a 

FO(TC^) formula of quantifier depth n distinguishing (9Jt, a) and (9T, b) . 
Proof. 

From the existence of a winning strategy for Spoiler in the FO(TC^)-game of length n in 
between (9Jl, a) and (9T, b), we will infer the existence of a FO(TC^)-formula of quantifier 
depth n distinguishing (9Jl, a) and (OT, 6). 
By induction on n. 

Base step: With round the initial match between distinguished objects must have 
failed to be a partial isomorphism for Spoiler to win. This implies that (9K, a) and (9T, b) 
disagree on some atomic formula. 

Inductive step: The induction hypothesis says that for every two structures, if Spoiler 
can win their comparison game over n rounds, then the structures disagree on some 
FO(TC^)-formula of quantifier depth n. Assume that for some structures (9K, a), (91, 6), 
Spoiler has a winning strategy for the game over n + 1 rounds. Let us reason on Spoiler's 
first move in the game. It can either be a TC or an 3 move. 

If it is an 3 move, then it means that Spoiler picks an element a in one of the two 
structures, so that no matter what element b Duplicator picks in the other. Spoiler has 
an n-round winning strategy. But then we can use the induction hypothesis, and find 
for each such b a formula ipb{x) that distinguishes (9Jl, a, a) from (9T, b, b). In fact we can 
assume that in each case the respective formula is true of (9K, a, a) and false of (9T, b, b) 
(by negating the formula if needed). Now take the big conjunction (p{x) of all these 
formulas (which is equivalent to a finite formula according to Lemma IS.lip and prefix 
it with an existential quantifier. Then the resulting formula is true in (9Jt, a) but false 
in (or, 6). It is true in (9Jt, a) if we pick a for the existentially quantified variable. And 
no matter which element we pick in (9^1,5), it will always falsify one of the conjuncts 
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in the formula, by construction. So, the new formula is false in (91,6). I.e., 3xip{x) of 
quantifier depth n + 1 distinguishes (Tl, a) and {^t, b). 

If Spoiler's first move is a TC-move, then it means that Spoiler picks a subset in one 
structure, let say A € Agjj (with ai (z A and aj ^ A), so that no matter which B £ Afrt 
(with hi £ B and hj B) Duplicator picks in the other structure. Spoiler can pick € B, 
bk+i B such that no matter which G A, Ofc+i ^ A Duplicator picks. Spoiler has 
an n-round winning strategy. For each B that might be chosen by Duplicator, Spoiler's 
given strategy gives a fixed couple 6^,6^+1. For each response 0^,0^+1 of Duplicator, 
we thus obtain by inductive hypothesis a discriminating formula ifB,ak,ak+i{x,y) that 
we can assume to be true in (91, b) for bk, 6fc+i and false in (9Jl, a) for ak, flfc+i- Now for 
each B, let us take the big conjunction ^3(^0, y) of all these formulas (which is finite, by 
Lemma IS.lip . We can then construct the big disjunction $(x,y) (again finite, by the 
same lemma) of all the formulas ^B{x,y)- 

Considering the first round in the game together with the inductive hypothesis, note 
that the MSO formula 3X{ai e X A aj ^ X A \/xy{{x e X A y ^ X) ^^{x,y))) 
holds in (9H, a). Indeed, by induction hypothesis, any couple € A,ak+i ^ A that 
Duplicator might choose in (iom(9Jl) will always falsify at least one of the conjuncts 
of each $B(x,y). Finally, the formula ^{x,y) being constructed as the disjunction 
of all the formulas ^Bix,y), any such couple a(fc,afc-|_i will also falsify ^{x,y). Now 
the MSO formula 3X{ai e X A aj ^ X A yxy{{x & X A y X) ^ ^^>(x,?/))) is 
equivalent to 3X{ai £ X Aaj ^ X A -'3xy{x £ X A ^{x, y) Ay ^ X))^ which means that 
(m,a)^[TC,y<^>ix,y)]iai,aj). 

On the other hand for the same reasons, note that it holds in (91,6) that VX((6j € 
X Abj ^ X) — 3xy{x £ X A y ^ X A ^{x,y))). Indeed, by induction hypothesis, for 
each B that Duplicator might choose in Af^ Spoiler will always be able to find a couple 
bk G B,bk+i ^ B satisfying all the conjuncts of the corresponding formulas ^B{x,y). 
Finally, the formula $(x, y) being constructed as the disjunction of all the formulas 
$s(a;,y), such a couple 0^,0^+1 will also satisfy ^{x,y). Now VX((6j £ X Abj ^ X) 
3xy{x £ X Ay ^ X A ^>(x, y))) is equivalent to yX{bi ^X\lbj £ X V 3xy{x £ X Ay ^ 
X A <^>{x,y))), which means that (91,6) \= [TCxy<^>{x,y)]{bi,bj). 

Let li be a name for the parameters aj,6j and v for bi,bj. [TCxy^{x,y)]{u,v) of 
quantifier depth n + 1 distinguishes (91, a) and (9Jl, 6). 

From the existence of a FO(TC^) formula of quantifier depth n distinguishing (OH, a) 
and (91, 6) we will infer the existence of a winning strategy for Spoiler in the game 
EF-o+^ci{Tl,a),im,b)). 
By induction on n. 

Base step: Doing nothing is a strategy for Spoiler. 

Inductive step: The inductive hypothesis says that, for every two structures, if they 
disagree on some FO(TC^) formula of quantifier depth n, then Spoiler has a winning 
strategy in the n-round game. Now, assume that some expanded structures (9Jt, a), 
(91,6) disagree on some FO(TC^) formula x of quantifier depth n + 1. Any such for- 
mula must be equivalent to a Boolean combination of formulas of the form 3xiIj{x) and 
[TCxyf{x,y)]{u,v) with tp, ip of quantifier depth at most n. If x distinguishes the two 
structures, then there is at least one component of this Boolean combination which 
suffices for distinguishing them. 

Let us first suppose that it is of the form 3x'il^{x). We may assume without loss 
of generality that (9Jl, a) |= 3xiIj{x) whereas (9t, 6) ^ 3xip{x). Then it means that 
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there exists an object a € (iom(9H) such that (971, a) \= ip{a) whereas for every object 
b € (iom(OT), (OT, 6) ^ ipib). But then we can use our induction hypothesis and find for 
each such b a winning strategy for Spoiler in EF^q^j^i^{{TI, a, a), 6, 6)). We can infer 

that Spoiler has a winning strategy in EF^q^qi^{{TI, a) , 6)). His first move consists 
in picking the object a in dom(JM) and for each response b in dom(91) of Duplicator, the 
remaining of his winning strategy is the same as in EF^q^j^i^{{TI, a, a), (^, b, b)). 

We now suppose that [TCxy^{x, y)] (u, v) of quantifier depth n+1 distinguishes the two 
structures. We may assume without loss of generality that {Tl,a) \= [TCxy(p{x,y)]{u,v) 
i.e. it holds in {M,a) that VX((ai e X A aj ^ X) 3xy{x € X Ay ^ X A ip{x,y))), 
whereas (9T,6) ^ [TCxy(p{x,y)]{u,v) i.e. it holds in (01,6) that 3X{bi £ X A bj 
X A -'3xy{x G X A ip{x,y) Ay ^ X)). We want to show that Spoiler has a winning 
strategy in ii^FpQ^^^ij((9Jt, o), (9T, 6)). Let us describe her first move. She first chooses 

(9T,6) and B e A<yi such that bi € B A bj ^ B A Sxy{x e B A v?(x,y) Ay ^ B). 
By definition of TC, such a set exists. Duplicator has to respond by picking a set 
A in Atjji containing Oj and not aj. Spoiler then picks G A and a^+i A such 
that (5Jt, a) \= 99(0^,0^+1). This is possible because by definition of TC, for any 
possible choice A of Duplicator (i.e., any set A containing Oj and not aj) we have 
3xy{x G A A y ^ A A if{x,y)). But that means that Duplicator is now stuck and 
has to pick b^ & B and bk+i ^ B such that (^,6) ^ ip{bk,bk+i)- Consequently, 
we have {%b,bk,bk+i) ^ V^ix,y), whereas (9^,0,0^,0^+1) |= (p{x,y). As ipix,y) 
is of quantifier depth n, by induction hypothesis. Spoiler has a winning strategy in 
i?FpQj-|-^ij((9JI, o, Ofc, Ofc+i), (9T, b, bki bk+i))- The remaining of Spoiler's winning strategy 

in iiJFpQ^^^ij((9K, o), (9T, 6)) (i.e. after her first move, that we already accounted for) is 

consequently as in £;F|?Q^-p^i^((9n, o, o^, o^+i), (9T, 6, 6^, 6^+1))- □ 

Corollary 5.19. For structures 9H, and n > 0, Duplicator has a winning strategy in 
i?FpQ^^^i^(9Jt, or) if and only if ^ ='^^^^^1^ OT. In particular, Duplicator has a winning 
strategy in all EF^q^jqij- games of finite length between 9JT and 97 if and only i/97t =fO(TC^) 

m. 

Let us finally consider the FO(LFP^) case. There are two classical equivalent syntactic 
ways of defining the syntax of FO(LFP^): the one we used in Section [2.21 and another one, 
dispensing with restrictions to positive formulas, but allowing negations only in front of 
atomic formulas and introducing a greatest fixed-point operator as the dual of the least 
fixed-point operator (also V cannot be defined using 3 and has to be introduced separately, 
similarly for the Boolean connectives). This second way of defining FO(LFP^) turns out to 
be more convenient to define an adequate Ehrenfeucht-Prai'sse game. The game is suitable 
to use on Henkin structures because the semantics on which it relies is merely a syntactical 
variant of the one given in Section H Now the FO(LFPi)-formulas [LFPxx^{x, X)]y and 
[GFPxxV'i^T ^)]y^ stating that a point belongs to the least fixed-point, or respectively, to 
the greatest fixed-point induced by the formula 99 satisfy the following equations: 

[LFPxxVix, X)]y ^ yXi^Xy ^ 3x{^Xx A ip{x, X))) 
[GFPxx^ix, X)]y o 3XiXy A Vx(Xx ^ ^{x, X))) 

Note that this holds no matter whether we be concerned with FO(LFP^) and MSO on 
standard structures or on Henkin structures. The consideration of these equations is the 
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key idea behind an Ehrenfeucht-Fraisse game defined by Uwe Bosse in [4] for least fixed- 
point logic FO(LFP) (i.e. where fixed-points are not only considered for monadic operators, 
but for any n-ary operator). FO(LFP^) being simply the monadic fragment of FO(LFP), 
the game for FO(LFP) can be adapted to FO(LFP^) in a straightforward way: 

Definition 5.20 (FO(LFPi) Ehrenfeucht-Fraisse game). Consider FO(LFP^)-Henkin struc- 
tures Tl and 01 together with a € dom{my , b € dom{mY, A G A^, 6 G A^, r > 0, s > 0, 
n > 0. In the game EF^Q^^^pi^{{Tl, A,a), B ,b)) of length n, there are two types of 
moves, point and fixed-point moves. Each move results in an extension of the assignment 
a 1-^ b, A 1-^ B with elements a^+i € dom{Tl), bs+i € (iom(OT), and possibly (in the case of 
fixed-point moves) with sets ^r+i G Br+i € A<yx. Spoiler chooses the kind of move to 
be played. Now the following moves are possible: 

• 3 move: Spoiler chooses Os+i E dom{Tl) and Duplicator 6^+1 G dom{^). 

• V move: Spoiler chooses bs+i G dom(9I) and Duplicator a^+i G dom(JM). 

In each point move, the assignment is extended by Us+i i-> fos+i- 

• LFP move: Spoiler chooses Br+i G Ag^ \ {dom(9I)} with some pebble bi B^+i and 
Duplicator responds with Ar+i G Agjt \ {dom{Tl)}. 

Now Spoiler chooses in dom{'*M) a new element Os+i ^ ^r+i and Duplicator answers 
in dom{^) with bg+i ^ -Br+i- 

• GFP move: Spoiler chooses Ar+i G A^ \ {dom{Tl)} with some pebble G ^4^+1 and 
Duplicator responds with Br+i G Af^ \ {dom(9T)} such that -Br+i ^ 0. 

Now Spoiler chooses in (iom(9T) a new element fts+i G -Br+i and Duplicator answers in 
(iom(9}t) with Ug+i G A^+i. 
In each fixed-point move the assignment is extended by A^+i ^ -Br+i, cts+i i— ^s+i- 

After n moves, Duplicator has won if the constructed element assignment a i—)- 6 is a 
partial isomorphism and for the subset assignment A B, for any 1 < j < r and i < s: 

Qi G Aj implies bi G Bj 

We call an assignment with these properties a posimorphism. 

Theorem 5.21 (FO(LFP^) Adequacy). Assume a finite relational FO(LFP^) language. 
Given two FO(LFP^)-Henkin structures and A e A^, 5 G B^^, a G dom{Tiy , 
b G dom{^Y and r > 0, s > 0, n > 0, Duplicator has a winning strategy in the game 
EF^Q^i_i,p^^{{Tl,A,d),{m,B,b)) iff{Tl,A,a) and im,B,b) satisfy the same FO(LFP^)-for- 
mulas of quantifier depth n. 

For a proof in the case of standard structures, we refer the reader to Uwe Bosse [3]. As 
pointed out earlier, the same argument works as well in the case of Henkin structures. 

5.2. Fusion Theorems on Henkin-Structures. Let A G {MSO, FO(TC^), FO(LFP^)}. 

We show our analogues of Feferman-Vaught theorem for fusions of A-Henkin-structures. We 
will refer to them as A.- fusion Theorems^ even though they will sometimes be formally first 
stated as corollaries. What we show is, more precisely, that fusion of A-Henkin-structures 
preserve A-equivalence for all fixed quantifier-depths. 

In order to give inductive proofs for MSO and FO(LFP^), it will be more convenient to 
consider parametrized A-Henkin-structures where the set of set parameters is closed under 
union, this notion being defined below. This is safe because whenever two parametrized 
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structures {9Jl,A,a) and are n-A-equivalent, it follows trivially that 9Jt and OT 

considered together with a subset of this set of parameters are also n-A-equivalent. 

Definition 5.22. Let Ai,...,Ak be a finite sequence of set parameters. We define the 

sequence (^i, . . . , A^.)^ as the finite sequence of set parameters obtained by closing the set 
{Ai, . . . , ^4^} under union, i.e., {Ai, . . . , A^)^ = {Uie/ ^il-^ ^ {Ij ■ ■ ■ ! k}}. (We additionally 
assume that this set is ordered in a fixed canonical way, depending on the index sets /.) 

Theorem 5.23 (Fusion Theorem for MSO). Let DJti and D^j be MSO-Henkin structures, 
where 1 <i < k. Furthermore, for 1 <i < k, letdi, bi be sequences of first- order parameters 

of the form ai^^, . . . , , bi-^, . . . , bi^ ( where m G N m,ay depend on i) and Ai, Bi sequences 
of set parameters of the form Ai^ , ■ ■ ■ , Ai^, , Bi^ , ■ ■ ■ , Bi^, (where m' G N may again depend 
on i ). Whenever 

{m.i,Ai,ai) =Mso {%,Bi,bi) for alll<i<k, 

then also 

f _ _ /____ 

fmi,(Ai,...,Afe)^,a-i,...,a-fc =^50 9Ii,(5i,...,^fc)^,6i,...,6fc. 

l<i<fc l<i<A; 

Proof. We define a winning strategy for Duplicator in the game 

/__ /____ 
^^MS0(( 9Jti,(li,...,^fe)U,a-i,...,a-fe),( OT,,(Si,...,Sfc)U,6i,...,6fe)) 

l<i<A; l<«<fe 

out of her winning strategies in the games £?F|5jgQ((*Hj, Ai,di), (9Tj, Bi,bi)) by induction on 
n. 

Base step: n = 0, doing nothing is a strategy for Duplicator. We need to show that 

/ _ _ 

( mi,{Ai,...,Akf,di,...,dk) 

l<i<k 

and 

/ 

( at,,(5i,...,Sfe)U,6i,...,6fc) 

l<i<fc 

agree on all atomic formulas. Now in the fusion structures, each atomic formula is de- 
fined by / in terms of a c7*-quantifier free formula that is evaluated in the corresponding 
disjoint union structure. So it is enough to show that the disjoint union structures agree 
on all atomic cj*-formulas and on their Boolean combinations. The initial match between 
the distinguished objects in {dJli,Ai,di) and (9^,Bj,6j) is a partial isomorphism for every 
1 < f < A;, so it is also one for 9?Tj, oi, . . . , and l+J-j^^,.^^, OTj, 6i, . . . , 6fc i.e. the 

two disjoint union structures extended with FO parameters agree on all (T*-atomic formu- 
las. We still need to show that it is also one for l+Ji<j<fe (^i, • • • , Ak)^,di, . . . , Ofe and 
Wi<i<A; (-^1' ■ ■ ■ ' Bk)^,bi, ... ,bk i.e. the two disjoint union structures extended with FO 
parameters and the closure under union of set parameters agree on all o"*-atomic formulas. 
It is enough to point that for every parameter Oj^. , for every / C {ii, . . . , i^i, . . . ,ki, k^'} 
by construction of IJie/ ^« (^i' • • • > ^fe)^' following are equivalent: 

• l±)l<i<fe^«'(^l'---'^ik)'^'«l'---'«fc N UiG/^*S' 
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• l±ii<i<fc^*' (^1' • • • )^fc)^)^jn'^i' • • • jOfc 1= Aittij for some ii in /. 

Similarly for every parameter bi. , by construction oi IJ^^j Bi in {Bi, B^)'^ , the following 
are equivalent: 

• l±)l<i<fc (^1, • • • > Bk)'', bi, . .. , h hU^eI 

• l±li<i<fc^i' (Bi,-- . ,Bk)^,Bi^,bi, ...,bk ^ Bi^bi^ for some ii in /. 

But by Duplicator's winning strategy in the small structure games, we know that the 
following are equivalent: 

• l±Ji<i<jfc (^i' ■ ■ ■ ' 4'^)'^' An «i' ■■■,ak\= Ai^Qi- for some ii in I. 

• l±li<j<fc (-Bi, • • • , -Bfe)^, Sj;, 6i, ...,bk\= Bifii. for some in I. 
So the following are also equivalent: 

• l±)i<j<fc^j) (^1, • • .,Ak)^,ai, ...,ak \= Uie/ A«*j' 

• Wi<i<fc (^1, • • • , ^fc)'', 6i, . . . , 6fc H Ue/ ^i^i.' 

So the two extended disjoint union structures agree on all (T*-atomic formulas. Now relying 
on the semantics of Boolean connectives, it can be shown by induction on the complexity 
of quantifier free sentences that they also agree on all Boolean combinations of atomic 
c7*-sentences. 

Inductive step: the inductive hypothesis says that whenever Duplicator has a winning 
strategy in EF^QQ{{^i, Ai,di), (OTj, Sj, bj)) for all 1 < z < A;, he also has one in 

/__ /____ 
^^MSo(( 2Jli, (li, . . . , Ak)"", ai, . . . , ofc), ( 01,, . . . , 6i, . . . , bk)). 

l<i<k l<i<k 

We want to show that this also holds when the length of the games is n + 1. Sup- 
pose Duplicator has a winning strategy in the game £'F|^gQ((9Jti, Aj, Oj), (9Tj, 5j, for 
all 1 < i < /c. We describe Duplicator's answer to Spoiler's first move in the game 

BFmoii®i<i<k 'M^,Ai,...,Ak,di,..., dk), {®(<i<k %,Bi,...,Bk,h,---, bk))- 

It will then follow by induction hypothesis, that he has a winning strategy in the remaining 

n-length game. 

• Spoiler's first move is a point move. Suppose Spoiler picks a in ©i<j<fcSC'li. Then 
a belongs to dom(JM,i) for some 1 < i < k. So Duplicator uses his winning strategy 
in EFf^QQ{{Wi,Ai,di),{'^i,Bi,bi)) to pick b G o?om(9Tj), so that he still has a winning 
strategy in EF^QQ{{Tli, Ai, di,a), (Otj, Sj, bi,b)). By induction hypothesis he also has one 
in the remaining n-length MSO game between the following two structures: 

/ _ _ 

( fOTi,(Ai,...,Afe)^,ai,...,afe,a) 

l<i<fc 

and 

/ 

( OT„(i?i,...,i?fc)^,5i,...,6fc,6) 

l<i<fe 

• Spoiler's first move is a set move. Suppose Spoiler chooses a set A in the set of admissible 
subsets of ©{<,j<^. QH-j. Then A is necessarily of the form Ai U ... U A^, with Ai an 
admissible subset of OJlj. We now define locally his response B = Bi U . . . U Bk, using 
his winning strategies in the small structures, so that he still has a winning strategy in 
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EF^QQ{{Tli, Ai, Ai,ai), {^i, Bi, Bi,bi)) for all 1 < i < fc. By induction hypothesis, he 
also has one in the remaining n-length MSO game between the following two structures: 

/ _ 

( Tli,{Ai,Ai,...,Ak,Ak)'^,di,...,ak) 

l<i<k 

and 

/ _ 

( mi,{Bi,Bi,...,Bk,Bk)'',b\,...,bk). 

l<i<k 

(Note that this is enough, because A £ (^i, ^i, . . . , Ak, A^)'^.) □ 

Now an analogue of this result for disjoint unions can easily be derived as a corollary of 
Theorem 15.231 For the convenience of the reader, we provide here the detailed argument: 

Corollary 5.24. Whenever {Wli, Ai,ai) =%qq {^i,Bi,bi) for all 1 < i < k (with di a 
sequence of first- order parameters of the form ajj, . . . ,aj„j with m S N and Ai a sequence 
of set parameters of the form Ai-^ , . . . , Ai^, with m' G N, similarly for the bi and Bi), then 
also \Si<i<k (^1, • • • , Ak)^,di, ...,dk =mso ^i<i<k i^i,---, Bk)^,bi, . . . , 6^. 

Proof. Let (Tli, Ai,di) =^qq {^i,Bi,bi) for all 1 < i < k (with di a sequence of first-order 
parameters of the form , . . . , with m G N and Ai a sequence of set parameters of the 
form Ai-^ , ■ ■ ■ , Ai^, with m' G N, similarly for the bi and Bi). 

Now consider the following expansions 9Jt^ and of the a structures QJlj and 9Tj to 
a* = aU {Qi, . . . , Qk}'- the interpretation of Qj is empty in Tl[ (respectively 01^ whenever 
i ^ j and it is the domain of Tl'^ (respectively DlQ whenever i = j. 

Clearly {m'^,Ai,d^) =1^qq {%,B,,hi) for slll<i<k. 

Now consider a mapping / such that for every n-ary predicate P G o"*, f{P) = 
Pxi . . .Xn- By Theorem 15.231 we have that 

/__ f _ _ _ _ 

m'i, (^, . . . , ai, . . . , Ofc =;^so 91^51, ... , BkVM, ■ ■ ■ , bk. 

l<i<k l<i<k 

Corollary 15.241 follows, because 

/__ f _ _ _ _ 

m'i,{Ai,..., ^)^, ai, ...,dk and %,{Bi,..., 5^)^, bi,..., b'k 

l<i<k l<i<k 

are isomorphic (w.r.t. a) to 

[+J Tli, {Ai,...,Ak)'~',di,...,dk and l+J n,, {Bi, . . . , Bk)"" ,bi, . . . ,bk 

l<i<k l<i<k 

respectively. □ 
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Another important corollary of Theorem 15.231 is the fact that fusions of MSO-Henkin 
structures are also MSO-Henkin structures. Let us stress the importance of this fact, which 
is needed for the correctness of our main completeness argument. 

Corollary 5.25. -^0/ tjjt ^-^ closed under MSO parametric definability and so ©{<j<^ 
is a MSO-Henkin structure. 

Proof. First note that the following are equivalent: 

• B is MSO parametrically definable in 971, 

• for some n, there is a finite sequence of parameters o, A such that B is defined by a MSO 
formula ip of quantifier depth n using o, A, 

• for some n, for every two points a, a' E (iom(9K), if they are MSO n-indistinguishable 
using a, A, then a B iS a' €z B. 

Now suppose for the sake of contradiction that there is 5 C c?om(^:[^ -^^ MSO para- 
metrically definable in ©{<j<fc9?^i using a', A', but B ^ ^(f,f <m.- So it means that for 

some l<i<k, Ai = Br\ dom(dJti) is not MSO parametrically definable in 9Jlj i.e. there 
are two MSO parametrically indistinguishable points a G B, a' ^ B. So for all n, for all 
sequence of parameters a, A in Tli, 

{Mi, a, A, a) =J^so {'^ua,A,a') 

and by the fusion theorem 

/ ___ / 

mi,a,A,a',A',a=lf^Qo mi,a,A,a',A',a' 

l<i<k l<i<k 

But this entails that B is not MSO parametrically definable in 0{<j<;i, 9?Tj using a', A', 
which is a contradiction. □ 

Corollary 5.26. Ay^^.^^^, is closed under MSO parametric definability and so l+lx<j<fc 
is a MSO-Henkin structure. 

Proof. Analogous to the proof of Corollary 15.251 (as = ''^I+Ik <fc I— ' 

Let us now consider the FO(TC^) case. As TC moves can only be played when there 
are already two pebbles on the board, it is more convenient to show first a version of our 
FO(TC^) fusion theorem in which each small structure comes with at least two parameters. 
This allows us to define Duplicator's answer to a TC move played in a big structure, by 
means of his winning strategies in the corresponding small structures. We then derive as a 
corollary the fusion theorem for non-parametrized structures. 

Theorem 5.27 (Fusion Theorem for FO(TC^)). Let 9Jlj and be FO(TC^)-Henkin struc- 
tures, where 1 < i < k. Furthermore, for 1 < i < k, let di, hi be sequences of first- order 
parameters of the form aj^,...,ai^, bi-^, . . . ,bi^ (where m S N may depend on i), where 
each sequence di (or bi) contains at least two distinct elements, unless the structure 9Hj 
(respectively, has only one element. Whenever 

(jMi,di) ="^0(70^) (^i'^i) f""^ alll <i <k, 

^There is no need to consider the case where a', A' is empty, because if a set is parametrically definable 
using no parameter, it is also definable using parameters. 
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then also 

f f _ _ 

DJlj, oi , . . . , a)j =po(TC^) ^1 ' • • • ' ^fe- 

l<i<ifc l<i<fc 

Proof. We define a winning strategy for Duplicator in the game 

/ / _ _ 

^^FO(TCi)(( © ai, . . . , dk), (0 ^, 6i, . . . , h)) 

l<i<k l<i<k 

out of her winning strategies in the games EFpQ^j^i^{(JMi,di), (0Tj,6j)) by induction on n. 

Base step: n = 0, doing nothing is a strategy for Duplicator. We need to show that 
the ©{<j<fe SPTj, oi, . . . , Ofe and ©f<j<fc bi, . . . ,bk agree on all atomic formulas. Now in 
the fusion structures, each atomic formula is defined by / in terms of a o"*-quantificr free 
formula that is evaluated in the corresponding disjoint union structure. So it is enough to 
show that the disjoint union structures agree on all atomic cr*-formulas and on their Boolean 
combinations. The initial match between the distinguished objects in (9Jtj,ai) and 
is a partial isomorphism for every 1 < i < /c, so it is also one for l+lx<j<,t 93Tj, oi, . . . , Ofc and 
Wi<i<fc bi, ■ ■ ■ ,bk i.e. the two disjoint union structures agree on all (T*-atomic formulas. 
Now relying on the semantics of Boolean connectives, it can be shown by induction on the 
complexity of quantifier free sentences that they also agree on all Boolean combinations of 
atomic cj*-sentences. 

Inductive step: the inductive hypothesis says that whenever Duplicator has a win- 
ning strategy in the game EF^Q^jQi.^{{Tli,di),{Vli,bi)) for some (SPtj, Oj), (OTj, bj) satisfy- 
ing the required conditions on parameters and 1 < i < fc, he also has one in the game 
E^FO(TC-)(i®L<k ai, • • ■ , a-k), (ef<,<fc 0T„ 6i, . . . , bk)). 

We want to show that this also holds whenever the length of the game is n + 1. 
Suppose Duplicator has a winning strategy in the game EF^Q^Qi^{{d)li,di), {^i,bi)) for 
ail 1 < i < k. We describe Duplicator's answer to Spoiler's first move in the game 
EF;+i^ij((©f<.<^ m,,du..., dk), (0{<i<fc OTi, bi, h)). It will then follow by indue- 
tion hypothesis, that he has a winning strategy in the remaining n-length game. 

• Spoiler's first move is an 3 move. Let Spoiler choose a point a € c^om(©:[^.^^ 
then a € (iom(9Jlj) for some 1 < i < k. So Duplicator can use his winning strategy 
in £JFpQ^-|-|-,ij((9Jti, Oj), {Vli,bi)) and pick a corresponding point b in the other structure. 

Now he still has a winning strategy in EF^Q^jQi^{{'[Oti, di, a), (OTj, 6^, b)). So by induction 
hypothesis he also has one in the remaining n length game 

/ / _ _ 

^^FO(TCi)(( © «!' • • • ' ( © ^1' • • • ' b)). 

l<i<k l<i<k 

• Spoiler's first move is a TC move. Suppose Spoiler chooses a set A in the set of admissible 
subsets of ©i<j<jfc9?^j- Then A is necessarily of the form Ai U . . . U A^, with Ai an 
admissible subset (possibly empty) of OJtj. Her response B = Bi U . . . U can now be 
defined locally for each Bi using her winning strategies in the small structures. So let 
Spoiler choose A = U . . . U . Keeping in mind that each non single point small 
structure comes with at least two distinct parameters, there are four cases: 
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a) in (iom(S[Ri), there is a distinguished object inside, but also outside Ai, so Duplicator 
considers Ai together with these two parameters and constructs Bi by using his winning 
strategy in EFp+^^^,^{{Tli, di), {mi, bi)). 

b) in dom(jMi), only distinguished objects exist inside aJ^, so Duplicator considers any 
one of these distinguished objects, say aj, and looks at Ai\{aj} together with some pa- 
rameter inside Ai. Then he can use his winning strategy in EF^Q^^-i^^{{d}li, di), (OTj, bi)) 
to construct an answer that we call B'-. Now Bi = B'- U 

c) in (iom(9Hi), only distinguished objects exist outside AiQ so Duplicator similarly con- 
siders some distinguished object Uj and looks at Ai U {oj} together with some other 
parameter outside Ai, so that he can construct an answer that we call B'^ by using his 
winning strategy in EF^+\^,^{{mi,di), Now Bi = Bl\{bj}; 

d) Tli is a single point structure, then i?j = if ylj = and Bi = dom{Tli) if Ai = 
dom(9Tj). 

Once B = BiU . . .L) Bk has been constructed. Spoiler picks two points b € B and b' ^ B. 
There are two cases: 

1. b and b' belong to the domain of one and the same small structure 9Tj ; now dom(jMi) 
is as previously described in a),b),c) (but not d)), because two distinct points cannot 
belong to one and the same single point structure) and in each case Duplicator does 
the following: 

a) He uses his winning strategy in the game di), (O'lj, bi)) to answer with 
a, a', so that he still has a winning strategy in EF^Q^jQi^({Tli,di, a, a'), (OTj, bi, b, b')). 
By induction hypothesis he also has one in the remaining n length game 

/ / _ _ 

^^FO{TCi)(( ai, • • • , Ofc, a, a'), (0 01^, . . . , b'k, b, b')). 

l<i<k l<i<k 

b) Suppose initially that b' ^ bj. Now Duplicator considers Ai\{aj} together with aj 
and some other parameter inside this set. Then he uses his winning strategy in 
EF^Q^^j^^{{Tli, di), (OTj, bi)) to pick corresponding a, a' in so that he still has a 

winning strategy in EF^Q^jQi^{{Tli,di,a, a'), bi, b, b')). By induction hypothesis 
he also has one in the remaining n length game 

/ / _ _ 

^^FO{TC^)(( ai' • • • ' «> «'), (0 "^uh,..., bk, b, b')); 

l<i<k l<i<k 

Next, suppose b — bj. Then we choose a — aj. The parameter aj already matches 
b i.e., Duplicator has a winning strategy in 

EF^O(TC^)im,ai,a),{mi,bi,b)) 

that he may use to pick a', thus answering as if it was a point move (i.e., a' has 
to be n-equivalent to b'). Therefore Duplicator still has a winning strategy in 
i?FpQ^-|-^ij((9Jtj, di, a, a'), (9Tj, bi, b, b')). By induction hypothesis he also has one in 



Note that as a special case we may have At — dom(9Jli 
^Note that as a special case we may have Ai = 0. 
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the remaining n length game 

/ / _ _ 

^^FO(TCi)(( © oi, . . . , Ofe, a, a'), (0 %,h,..., b'k, b, b')). 

l<i<k l<i<k 

This works, except that there is the additional condition a' ^ Ai that Duplicator 
must also maintain in order to respect the rules of the game. A slightly more refined 
argument shows, however that there has to be an n-equivalcnt point to b' which is 
outside Ai. Indeed, instead of b, Spoiler could have picked any other point b* € Bi 
together with b' ^ Bi and Duplicator's winning strategy would have provided a 
correct answer a* e Ai, a' ^ A^, which means that Duplicator would have found 
some point a' which is at least n-equivalent to b' and lies outside Ai (because if 
Duplicator has a winning strategy in i?FpQ^-|-^ij((9Jtj, a^, a*, a'), 6.j, 6*, 6')) then 

he has one in EF^Q^j^i^{{Tli,di,a'),{^i,bi,b')) as well, and consequently also in 

^^FO(TCi)((^*' «' (^n h, b, b'))). 

c) Suppose initially that b ^ bj. Then Duplicator considers Ai U {aj} together 
with aj and with some other parameter outside this set and uses his winning 
strategy in EF^Q^Qj^^{{Tli,di), so that he still has a winning strategy in 

EF^Q^jQi^{{Tli, di,a, a'), bi,b, b')). By induction hypothesis he also has one in 
the remaining n length game 

/ / _ _ 

^^FO(TC^)(( © oi, . . . , Ofe, a, a'), (0 01^, 6i, . . . , 6^, 6, b'))- 

l<i<k l<i<k 

otherwise b' = bj, then a' = aj because the parameter aj already matches b' i.e., 
Duplicator has a winning strategy in 

^^FO(TCi)((^*' ^')), 
SO we can show by a similar argument as above that he can use it to pick a & Ai, 
so that he still has a winning strategy in £JFpQ^-|-^ij ((OJtj, Oj, a, a'), (91^, bi, b, b')). By 
induction hypothesis he also has one in the remaining n length game 
/ / _ _ 

^^FO(TCi)(( © oi, . . . , Ofe, a, a'), (0 01^, 6i, . . . , b, b')). 

l<i<k l<i<k 

otherwise b G dom(yii, bi) and b' G dom{^j, bj) with i / j; we can again use a similar 
argument to show that Duplicator can use his winning strategy in 

EF^+l^^,^{{mi,di),{mi,bi)) and EF;+^^^,^mj,dj),{mj,bj)) 

to pick a, a' in the right part of the structure (that is, inside or outside Ai), so that 
he still has a winning strategy in the games 

-^-^FO{TCi)((^*' ^^i' ^^fO(jc^)ii^j''^j^O'')^ i^j'bj, b')) (in the special 

case where for instance, Tlj is a single point structure. Duplicator picks the only 
available point in the other structure) . By induction hypothesis he also has one in the 



32 



A. GHEERBRANT AND B. TEN GATE 



remaining n length game 

/ / _ _ 

^^FO(TCi)(( © ai, • • • , Ofc, a, a'), ( ^Ti, h, . . . , h, b, b')). □ 

l<i<fc l<i<k 

We now show a corollary of the preceding lemma, in which the small structures do not come 
with any distinguished objects: 

Corollary 5.28. Whenever Tli =^^^^^1^ OTj for alll < i < k, then also 0{<j<fc Tli =^^^^^1^ 

®i<i<fc 

Proof. We know that Spoiler's first two moves in the FO(TC^)-game of length n + 1 between 
<i<k ^^"^ ©i<i<fc ™ust be quantifier moves, because the TC move can only be 
played once there are two pebbles on the board. Let us look at the first move. Suppose 
Spoiler plays a point a G dom{^^^.^j^Tli). So a G dom{Tli) for some 1 < i < k. By Du- 
plicator's winning strategy in EF^Q^jQ^^{Tli,^i), he has an answer b G dom(9Ij) such that 
{Tli, a) =FO(TCi) (^*'^)- rename a with and b with Similarly, for every j ^ i 

such that 1 < j < fc, fix some random point aj-^ coming from the domain oiTlj, Spoiler could 
have played this point and so Duplicator would have had an adequate answer bj-^ such that 
(£DTj,Ojj) =Pq^-|-^ij {ytj,bj-^). Now for the second round in the game, some point a' = ai^ or 

b' = bi^ coming from the domain of respectively SfJt; or 91/ will be played by Spoiler and Du- 
plicator will be able to answer so that (9Jt;, a;^ , ai^) =^'^^^^^ (*Jl/, 6/^ 1^12) ■ Similarly, for each 

Tlj such that j / /, we can find points such that , aj^) =^"^^^^1^ (^^Ij, bj^ , bj^). Now as 

for all 1 < z < /c, Duplicator has a winning strategy in EF^~'^^^^{{Tli, ,ai^), (OTj, bi^ , bi^)), 
by the previous lemma, he has one in 

/ / 
^^FO(JC^)^ 3^i,aii,ai2, . . . ,0^1,0^2), ( Crtj, 61^,612, . . . ,6^1,6^2)), 

l<i<A: l<j<fc 

so he also has one in SF^^^ ^^^^(©{^.^^ 9Jti, a, a'), (©(<,<fc 6, b')). □ 

Corollary 5.29. Whenever Tli ='^q^jq^^ /or all 1 < i < k, then it also holds that 
Wi<i<fc ^« ^FOfTCij Wi<j<fc 

Proof. Analogous to the proof of Corollary 15.241 □ 
Corollary 5.30. A^/ ™ is closed under FO(TC^) parametric definability and so the 
structure ©{<j<j!j 2Rj is a FO(TC^)-Henkin structure. 

Proof. Analogous to the proof of Corollary I5.25[ □ 

Corollary 5.31. Ay^^.^^gj;. is closed under FO(TC^) parametric definability and so the 
structure 1+Ji<j<fc9?ti is a FO(TC^)-Henkin structure. 

Proof. Analogous to the proof of Corollary 15.261 □ 
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In the FO(LFP^) case, the situation parallels the FO(TC^) case. As LFP moves can 
only be played when there is already one pebble on the board, it is more convenient to show 
first a version of our FO(LFP^) fusion theorem in which each small structure comes with 
at least one FO parameter. This allows us to define Duplicator's answer to a LFP move 
played in the big structure, by means of his winning strategies in the small structures. We 
then derive as a corollary the fusion theorem for non-parametrized structures. 

Theorem 5.32 (Fusion Theorem for FO(LFP^)). Let di, bi be non empty sequences of first- 
order parameters of the form a^^ , . . . , Oj^ , , . . . , bi^ , with m G N and Ai, Bi sequences of 
set parameters of the form Ai^ , ■ ■ ■ , Ai^, , Bi^ , . . . , Bi^, with m' G N. Whenever 

{Tli,Ai,ai) =FO(LFP^) i^i^^i^bi) for alll <i <k, 

then also 

f _ _ / _ _ __ 

{Ai, ai, . . . , Ofe =''fo(LFP^) %,{Bi,..., Bkf, h, bk. 

l<i<k l<i<k 

Proof. We proceed by induction on n, defining a winning strategy for Duplicator in the game 

^^FO(LFPi)((®^<*<fc (^1' • • • ' ^kT^'^l^ • • - (©■f<.<fc (^1, • • • , BkT. h,..., bk)), 
out of her winning strategies in the games EF^Q^^^^pi^[{dyti,Ai,di), (DTj, 6j)). 

Base step: n = 0, doing nothing is a strategy for Duplicator (this can be justified by a 

similar argument as in the MSO case). 

Inductive step: the inductive hypothesis says that whenever Duplicator has a winning 
strategy in EF^Q^^^pi^{{Tli,Ai,di), (91^, Bi, bi)) for pairs of structures {Tli,Ai,di), (91^, Bi, bi) 
satisfying the required conditions on parameters with 1 < z < A;, he also has one in 
E^P0(LFP^)(i®{<i<k . . . , ^fe)^, Oi, . . . , dk), (©{<,<fe Oti, (Si, . . . , 6i, . . . , h)). 

We want to show that this also holds when the length of the games is n+1. Suppose Du- 
plicator has a winning strategy in EF^Q^^^pj^^{{d}li, Ai, di), (CTlj, Bj, 6j)) for all 1 < z < A;. Wc 

describe Duplicator's answer to Spoiler's first move in the FO(LFP^)-game of length n-|- 1 in 
between (0(<j<fc ^Xlti, (^i, • • • , ^fe)*^, oi, . . . , dk) and (0(<j<fc {Bi,..., Bk)^,bi, bk). 
It then follows by induction hypothesis, that he has a winning strategy in the remaining 
n-length game. 

• Spoiler's first move is an 3 move. 

Same argument as for MSO and FO(TC^). 

• Spoiler's first move is a V move. 

Symmetric. 

• Spoiler's first move is a GFP move. 

Suppose Spoiler chooses a set A in the set of admissible subsets of 0{<j<;j 9Jti with 
some pebble ai. € A. Then A is necessarily of the form AiU. . .U^^, with Ai an admissible 
subset of 9Jtj. Her response B = BiU . . .UBk can now be defined locally for each Bi using 
her winning strategies in the small structures. So let Spoiler choose A = AiU . . . U Ak. 
Keeping in mind that each small structure comes with at least one parameter, there are 
four cases: 

1) in dom{Tli), there is a distinguished object inside Ai and Ai ^ dom{Tli), so Duplicator 
considers Ai together with this parameter and constructs Bi by using his winning 
strategy in EF^+^^pp,^{{^mi, A, di), {%, Bi, bi)). 
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2) in dom(Tli), there are only distinguished objects outside Ai / 0, so Duphcator con- 
siders any aj among those and looks at Ai U {aj}, so that he can use his winning 
strategy in E'Fpj^^|^ppi^((9JIj, ^j, aj), (OTj, Sj, to construct an answer that we call 
B'-. Now Bi = B'j\{bj}. This is a correct answer, because the (posimorphism) condi- 
tion to be maintained (see Definition I5.20p is that for every pebble ai on the board 
at the end of the game, ai & Ai ^ bi € Bi. But by Duplicator's winning strategy in 
E'FpQ^j^Ppi J {{Tli , Ai, AiU{aj},ai), ,Bi,B[,bi)), we know already that for every such 

pebble, a; € U {aj} ^ bi G B'^, so also a; G =^ 6/ € Bl\{bj}, since the winning 
conditions will assure that = aj if and only if bi = bj. 

3) Bi = domiJMi). So Ai = dom{yti). As pebbles are only chosen using Duplicator's 
winning strategies in the small structures, the posimorphism condition will be main- 
tained. 

4) Bi = 0. So Ai = 0. As no pebble can belong to this set, the posimorphism condition 
will be maintained. 

Now that B = BiL) . . .yjBk has been constructed, Spoiler picks a new element b € B which 
belongs to the domain of one particular small structure Vti (so b G Bi) and dom{Tli) is 
as previously described either in 1, 2 or 3 (but not 4), because b cannot belong to the 
empty set) and in each case Duplicator does the following: 

1) Duplicator answers with a according to his winning strategy in 

^^FO(LFpi ){m,A,di),{mi,Bi,bi)); 

2) Duplicator again considers Ai U {aj} and answers according to his winning strategy in 
£'FpQ^j^Ppij((9JTj, ^j, Ai U {aj},di), {^,Bi, B[, bi)). This is safe, because the pebble to 
be chosen may be assumed to be fresh, so it won't be a^; 

3) Duplicator picks a random pebble aj G dom(9Jtj) and considers dom(9Jtj)\{aj}. His 
winning strategy provides him with a correct answer. 

So in any of these cases (either 1, 2 or 3), Duplicator has a winning strategy in the game 
-Ei^FO(LFPi)((^«' (^i' ^))- J 7^ 1 < J < ^) lie also has 

one in EF^Q^^^p^^{{^j,Aj,Aj,dj),{^j,Bj,Bj,bj)). So by induction hypothesis, he has 
one in the remaining n-length FO(LFP^) game between the following two structures: 

/ _ 

( Tli,{Ai,Ai,...,Ak,Ak)'' ,ai, . . . ,ak,a) 

l<i<k 

and 

/ _ 

( ^i,{Bi,Bi,...,Bk,Bk)'',bi,...,bk,b) 

l<i<k 

• Spoiler's first move is a LFP move. 

Symmetric. □ 

Corollary 5.33. Whenever DJli =^Q(i_ppij for all 1 < i < k, then it also holds that 

®f irm. =n n\f m. 

l<«<fc * ~FO(LFP^) ^l<i<k 

Proof. We know that Spoiler's first move in £'Fp^^j^ppij(0^^^^^ SUtj, ^^Ij) must be a 

FO quantifier move, because the LFP move can only be played once there is a pebble on the 
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board. Let us look at the first move. Suppose Spoiler plays a point a G do"^(©f<j<fe So 
a G dom{Tli) for some 1 <i < k. By Duplicator's winning strategy in EFpQ^^^pi^{Tli,^i), 
he has an answer b G dom(9Tj) such that {DJti,a) =^Q^^ppi^ Let us rename a with 

Oj and b with 6j. Similarly, for every j ^ i such that 1 < j < A;, fix some random point 
Qj coming from the domain of OJtj, Spoiler could have played this point and so Duplicator 
would have had an adequate answer bj such that {Tlj,aj) =po{LFpi) (^i'^i)- Now as for 
all 1 < i < k, Duplicator has a winning strategy in EFp~^^^ppi^{{Tli, Ui), {^i,bi)), by the 
previous lemma, he has one in E F^~^^pp^^{^{^.^i^ 2)1^, ai, . . . , a^), (0{<j<fe ^i, 6i, . . . , 6^)), 
so he also has one in -E-FpQjJ_ppij(0(<j<;, 

Corollary 5.34. Whenever Tli =fQ^ifpij ^« for all 1 < i < k, then it also holds that 
Wi<i<fc =^0(LFP^) Wi<i<fc hi- 
proof. Analogous to the proof of Corollary 15.241 □ 
Corollary 5.35. A^/ is closed under FO(LFP^) parametric definability and so the 
structure ©{<j<^ is a FO(LFP^)-Henkin structure. 

Proof. Analogous to the proof of Corollary 15.251 □ 
Corollary 5.36. Ay^^^^gj;. is closed under FO(LFP^) parametric definability and so the 
structure \Si<i<k FO(LFP^)-Henkin structure. 

Proof. Analogous to the proof of Corollary 15.261 □ 



6. Putting it Together: Completeness on Finite Trees 

6.1. Forests and Operations on Forests. In Section 16.21 we will prove that no A- 
sentence can distinguish A-Henkin-models of h^*^*^ from standard models of h^*^*^. More 
precisely, we will show that for each n, every definably well-founded A-quasi-tree is n-A- 
equivalent to a finite tree. In order to give an inductive proof, it will be more convenient 
to consider a stronger version of this result concerning a class of finite and infinite Henkin 
structures that we call quasi- forests. In this section, we give the definition of quasi-forest 
and we show how they can be combined into bigger quasi-forests using the notion of fusion 
from Section [5j Whenever quasi forests are finite, we simply call them finite forests. As a 
simple example, consider a finite tree and remove the root node, then it is no longer a finite 
tree. Instead it is a finite sequence of trees, whose roots stand in a linear (sibling) order^ It 
does not have a unique root, but it does have a unique left-most root. For technical reasons 
it will be convenient in the definition of quasi forests to add an extra monadic predicate R 
labeling the roots. 

Definition 6.1 (A-quasi- forest). Let T = {dom{T), <, ^, Pi, ...... . Pn, At) be a A-quasi- 
tree. Given a node a in T, consider the A-substructure of T generated by the set {x \3z{a ^ 
z Az < x)}, which is the set formed by a together with all its siblings to the right and their 

^Note that, as far as roots are concerned, two nodes can be siblings without sharing any parent. This 
would not happen in a quasi tree. 
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descendants. The A-quasi-forest Ta is obtained by labeling each root in this substructure 
with R {Rx <^def y < x). Whenever T is a tree, we simply call Ta a forest. 

We will show in our main proof of completeness that for each n and for each node a 
in a definably well-founded A-quasi-tree, the A-quasi-forest Ta is n-A-equivalent to a finite 
forest. Our proof will use a notion of composition of A-quasi- forests which is a special case 
of fusion. Given a single node forest Fi and two A-quasi- forests F2 and i<3, we construct 
a new A-quasi- forest 0^ (Fi,F2,F3) by letting the unique element in Fi be the left-most 
root, the roots of F2 become the children of this node and the roots of F^ become its 
siblings to the right. We then derive a corollary of the A-fusion theorem for compositions 
of A-quasi- forests and use it in Section 16.21 

Definition 6.2. Let a = {<, R, Pi, . . . , Pn}, be a relational vocabulary with only monadic 
predicates except < and ^. Given three additional monadic predicates Qi,Q2,Q3, we define 
a mapping from a to quantifier-free formulas over a U {Qi,Q2,Q3} by letting 
. f^{Pi) = Pi{xi) 

• /^(<) = Xi < ^2 V (gi(xi) A Q2{X2)) 

• f^i^) = XI ^ X2 V (Qi(xi) A Q3ix2) A R{X2)) 

• f^{R) = (Qsixi) A i?(xi)) V Qi(xi) 

Corollary 6.3. Let Fi be a single node forest and F2, F3 A-quasi forests. If F2 =^ F2 and 
F3 Fi then e-^^(Fi,F2,F3) {Fi,F^,Fi). 




Figure 6: A composition of forests using the mapping / 

Figure [6] represents a composition of three forests Fi, F2, -F3 which uses the mapping 
Only new <c/i-arrows are represented, linking the unique node in Fi to the root nodes 
in F2. But new ^-links have also been added and the roots in F3 have became the siblings 
to the right of the root in Fi. This is implicitly indicated by the left to right organization 
of the picture. 

6.2. Main Proof of Completeness. 

Lemma 6.4. For all n gN, every definably well-founded A-quasi-tree of finite signature is 
n-A-equivalent to a finite tree. In particular, a A-sentence is valid on definably well-founded 
A-quasi-trees iff it is valid on finite trees. 
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Proof. Let T be a A-quasi-tree, without loss of generality assume that a monadic predicate 
R labels its root (and only that node in the tree). During this proof, it will be convenient 
to work with A-quasi- forests. Note that finite A-quasi- forests are simply finite forests and 
finite A-quasi-trees are simply finite trees (cf. Proposition 13.11 for the case of quasi-trees, 
from which the case for quasi- forests follows immediately). Let X„ be the set of all nodes 
a of T for which it holds that Ta is n-A-equivalent to a finite forest. We first show that 
'belonging to X„" is a property definable in T (Claim 1). We then use the induction scheme 
to show that every node of a definably well-founded A-quasi-tree (so in particular, the root) 
has this property (Claim 2). 

Claim 1: Xn is invariant for n + 1-A-equivalence (i.e., (T, a) (T, b) implies that a G Xn 

iff 6 G Xn), and hence is defined by a A- formula of quantifier depth n + 1. 

Proof of claim. Suppose that (T, a) =n+i (^;^)- We will show that Ta =^ T;,, and hence, 
by the definition of a G X^ iff 6 G X^- By the definition of A-quasi-forests, dom(Ta) = 
{x I 3z{a ■< z f\ z < x)}. Let (p be any A-sentence of quantifier depth n. We can assume 
without loss of generality that ip does not contain the variables z and x (otherwise we can 
rename in ip these two variables). By lemma [531 (^j ^) N REL{(p, 3z{a :< z A z < x),x) iS 
Ta \= (p. Notice that REL{(p, 3z{a :< z Az < x),x) expresses precisely that ip holds in (T, a) 
within the subforest Ta- Moreover, the quantifier depth of REL{(p, 3z{a :< z A z < x) is at 
mostn+1. It follows that (T, a) ^ REL{ip,3z{a ^ zAz < x),x) iff (T, 6) \= REL{p,3z{b ^ 
z A z < x),x), and hence Ta \= p iS Tf, \= p. 

For the second part of the claim, note that by Lemma I5.1H up to logical equivalence, 
there are only finitely many A-formulas of any given quantifier depth, as the vocabulary is 
finite. H 

Claim 2: If all descendants and siblings to the right of a belong to X„, then a itself belongs 
to Xn- 

Proof of claim. Let us consider the case where a has both a descendant and a following 
sibling (all other cases are simpler). Then, by axioms T3, T5, T8, T9 and TIO, a has a first 
child 6, and an immediate next sibling c. Moreover, we know that both h and c are in X„. 
In other words, and Tc are n-A-equivalent to finite forests T^ and T'^. Now, we construct 
a finite A-quasi-forest by taking a /^-fusion of T^', T'^ and of the A-substructure of T 
generated by {o}, whose unique element becomes a common parent of all roots of T^ and a 

left sibling of all roots of T^. So we get = 0-' (T \ {a}, T^', T'^)). It is not hard to see that 

is again a finite forest. Moreover, by the fusion theorem, 0-^ (T \ {a},Ti,,Tc)) =^ T'^. 

Now to show that 0-^ (T \ {a}.,Tb,Tc)) is isomorphic to Ta (which entails Ta =n T'a i.e. 
Ta is n-A-equivalent to a finite forest), it is enough to show A^^ = ^'^^/a . It 

holds that , a , , , , C Aj' because we can define in Ta each such union of sets by 

means of a disjunction. Now to show A^^ C A^a , take A G Aj-^, so A = XflTa 

for some X C Ay. As dom(T \ {a}), dom(Tf,) and dom{Tc) are all definable in T, the 
intersection of each of these sets with A is also definable in T and hence A D dom(T \ {a}) 
is definable in dom(T \ {a}), A D dom(Tf,) is definable in Tf, and A fl dom{Tc) is definable 
in Tc. But then ^4, which can be formed as the union of these three sets, is also definable 
in A_ f A 
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It follows from these two claims, by the induction scheme for definable properties, that 
Xn contains all nodes of the A-quasi-tree, including the root, and hence T is n-A-equivalent 
to a finite tree (to a finite forest actually, but the root of the A-quasi-tree being labeled by 
R, it can also be viewed as a A-quasi-forest). For the second statement of the lemma, it 
suffices to note that every A-sentence has a finite vocabulary and a finite quantifier depth. □ 

Theorem 6.5. Let A € {MSO, FO(TC^), FO(LFP^)}. The A-theory of finite trees is com- 
pletely axiomatized by . 

Proof. Theorem 16.51 follows directly from Lemma 16.41 and Corollary 14.181 □ 



6.3. Definability of the Class of Finite Trees. Proposition 16.61 below shows together 
with Theorem 16. 51 that on standard structures, the set of h^^^ consequences actually defines 
the (not FO-definable) class of finite trees. That is, has no infinite standard model at 
all. 

Proposition 6.6 ([22]). Let A G {FO(TC^), FO(LFP^), MSO}. On standard structures, 
there is a A-formula which defines the class of finite trees. 

Sketch of the proof. It is enough to show it for A = FO(TC^). It follows by Section [2.31 that 
it also holds for MSO and FO(LFP^). We merely give a sketch of the proof. For additional 
details we refer the reader to [22j. 

Recall from Proposition 13.11 that, on finite standard structures, the finite conjunction 
of the axioms Tl-TlO in Figure [5] defines the class of finite trees, i.e., any finite structure 
satisfying this conjunction is a finite tree. Now we will explain how to construct another 
sentence, which together with this one, actually defines on arbitrary standard structures 
the class of finite trees. Let L be a shorthand for the formula labeling the leaves in the 
tree {Lx ^def x < y) and R a shorthand for the formula labeling the root {Rx <^def 
-i3y y < x). Consider the depth-first left-to-right ordering of nodes in a tree and the 
FO(TC^) formula ip{x,y) saying "the node that comes after x in this ordering is y": 

ip{x, y) -.'^ (-iLx A X <ch y A -i3zz ^ y) V (Lx A x -<ns v) V {Lx A Szx -< z A 3z{z < 
X A z y A -iBuiw < X A z < w A 3uw u)) 

There is also a FO(TC^) formula which says that "x is the very last node in this ordering". 
ip{x, y) can be combined with this formula into an FO(TC^) formula x expressing that the 
tree is finite by saying that (we rely here for the interpretation of x on the alternative 
semantics for the TC operator given in Proposition 12. 4p "there is a finite sequence of nodes 
xi . . .Xn such that xi is the root, Xj+i the node that comes after Xi in the above ordering, 
for all i, and x„ is the very last node of the tree in the above ordering". 

X -.^ 3u3z{Rz A [TCa^yip] {z, u) A ^3u{u / n' A [TC^yip] {u, u'))) □ 

Theorem 6.7. For A G {MSO, FO(TC^), FO(LFP^)}, the set of axioms Tl-TlO together 
with all A-instances of the induction scheme Lnd defines the class of finite trees. 

Proof. By Proposition 16.61 we can express in A by means of some formula x that a structure 
is a finite tree. So x is provable in h^*^^ (as it is a A-formula valid on the class of finite 
trees). In other words, if F is the set of axioms Tl-TlO together with all A-instances of the 
induction scheme Ind, then we have that T \-\ x- D 
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7. Finite Linear Orders 

Let us note that a simplified version of tfiis metliod can be used in order to sliow the 
completeness of MSO, FO(TC^) and FO(LFP^) on finite node-labeled linear orders (i.e., 
finite node- labeled trees in which every node has at most one child). The relevant simpler 
axioms are the ones listed in Figures [H [7] and respectively, Figures [21 [3] and |H 



LL 


yxyz{x < y /\y < z ^ X < z) 


< is transitive 


L2. 


-i3a;(a; < x) 


< is irreflexive 


L3. 


Vxy(x < y 3z(x <ch z ^ z < y)) 


immediate children 


L4. 


3x^y^{y < x) 


there is a root 


L5. 


\/xy{x = yVx<yVy<x) 


< is total 


Ind. 


Vx(Vy((x < y — )• ^{y)) — > ^x(p{x) 




where 








ip{x) ranges over A- formulas in one free variable x 




and 







X <ch y is shorthand for x < y A -'3z{z < y A x < z) 

Figure 7: Specific axioms on finite linear orders 



8. Conclusion 

In this paper, taking inspiration from Kees Doets |12] we developed a uniform method for 
obtaining complete axiomatizations of fragments of MSO on finite trees. For that purpose, 
we had to adapt classical tools and notions from finite model theory to the specificities of 
Henkin semantics. The presence of admissible subsets called for some refinements in model 
theoretic constructions such as formation of substructure or disjoint union. Also, we noticed 
that not every Ehrenfeucht-Frai'sse game that has been used for FO(TC^) was suitable to 
use on Henkin-structures. We focused on a game which does not seem to have been used 
previously in the literature. We also established analogues of the FO Feferman-Vaught 
theorem for MSO, FO(TC^) and FO(LFP^) on Henkin-structures (let us recall that related 
work for the case of standard structures can be found in [25j). We considered fusions, a 
particular case of the Feferman-Vaught notion of generalized product and obtained results 
for Henkin-structures which might be interesting to generalize and use in other contexts. 

We applied our method to MSO, FO(TC^) and FO(LFP^), but it would be worth also 
examining other fragments of MSO or logics such as monadic deterministic transitive clo- 
sure logic (FO(DTC^), which was advocated in |30j as particularly relevant in the context 
of applications to model-theoretic syntax) or monadic alternating transitive closure logic 
(FO(ATCi)), see also [6]. 

An important feature of our main completeness argument (the idea of which was bor- 
rowed from Kees Doets) is the way we used the inductive scheme of Figure O Hence, 
extending our approach to another class of finite structures would involve finding a com- 
parable scheme. We also know that we should focus on a logic which is decidable on this 
class, as on finite structures recursive enumerability is equivalent to decidability (as long 
as the model-checking is decidable). This suggests that other natural candidates would be 
fragments of MSO on classes of finite structures with bounded treewidth. 
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Finally, let us notice that MSO is also known to be decidable over infinite trees and 
over linear orders of order type oj. It would be interesting to look for a model-theoretic 
argument which would work on a Henkin model and produce an intended model of one of 
these theories in a way comparable to what we did here or to what Keisler did in |20J . Note 
that related complete axiomatizations of monadic theories of classes of infinite structures 
can be found in [5], |29] and [31j, but that instead of relying on Henkin-semantics, the 
completeness proofs there are based on automata-theoretic techniques. 
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